Title:Energy Loss Functions for Physical Systems
Abstract:Effectively leveraging prior knowledge of a system’s physics is crucial for applications of machine learning to scientific domains. Previous approaches mostly focused on incorporating physical insights at the architectural level. In this paper, we propose a framework to leverage physical information directly into the loss function for prediction and generative modeling tasks on systems like molecules and spins. We derive energy loss functions assuming that each data sample is in thermal equilibrium with respect to an approximate energy landscape. By using the reverse KL divergence with a Boltzmann distribution around …
Title:Energy Loss Functions for Physical Systems
Abstract:Effectively leveraging prior knowledge of a system’s physics is crucial for applications of machine learning to scientific domains. Previous approaches mostly focused on incorporating physical insights at the architectural level. In this paper, we propose a framework to leverage physical information directly into the loss function for prediction and generative modeling tasks on systems like molecules and spins. We derive energy loss functions assuming that each data sample is in thermal equilibrium with respect to an approximate energy landscape. By using the reverse KL divergence with a Boltzmann distribution around the data, we obtain the loss as an energy difference between the data and the model predictions. This perspective also recasts traditional objectives like MSE as energy-based, but with a physically meaningless energy. In contrast, our formulation yields physically grounded loss functions with gradients that better align with valid configurations, while being architecture-agnostic and computationally efficient. The energy loss functions also inherently respect physical symmetries. We demonstrate our approach on molecular generation and spin ground-state prediction and report significant improvements over baselines.
| Comments: | 10 pages, 4 figures, NeurIPS 2025 |
| Subjects: | Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Physics (physics.comp-ph) |
| Cite as: | arXiv:2511.02087 [cs.LG] |
| (or arXiv:2511.02087v1 [cs.LG] for this version) | |
| https://doi.org/10.48550/arXiv.2511.02087 arXiv-issued DOI via DataCite |
Submission history
From: Sékou-Oumar Kaba [view email] [v1] Mon, 3 Nov 2025 21:58:36 UTC (1,061 KB)