Finite-Volume Pionless Effective Field Theory for Few-Nucleon Systems with Differentiable Programming
Authors
Xiangkai Sun, William Detmold, Di Luo, Phiala E. Shanahan
Abstract
Finite-volume pionless effective field theory provides an efficient framework for the extrapolation of nuclear spectra and matrix elements calculated at finite volume in lattice QCD to infinite volume, and to nuclei with larger atomic number. In this work, it is demonstrated how this framework may be implemented via a set of correlated Gaussian wavefunctions optimised using differentiable programming and via solution of a generalised eigenvalue problem. This approach is shown to be significantly more efficient than a stochastic implementation of the variational method based on the same form of correlated Gaussian wavefunctions, yielding comparably accurate representations of the ground-state wavefunctions with an order of magnitude fewer terms. The efficiency of representation allows such calculations to be extended to larger systems than in previous work. The method is demonstrated through calculations of the binding energies of nuclei with atomic number $A\in\{2,3,4\}$ in finite volume, matched to lattice QCD calculations at quark masses corresponding to $m_π=806$ MeV, and infinite-volume effective field theory calculations of $A\in\{2,3,4,5,6\}$ systems based on this matching.
Concepts
The Big Picture
Imagine assembling a jigsaw puzzle where the pieces keep shifting shape. That’s roughly what nuclear physicists face when computing how protons and neutrons bind from first principles. Quantum Chromodynamics (QCD) describes quarks and gluons with exquisite precision, but its equations become impossibly tangled at the energy scales where nuclei form.
The standard workaround is to discretize space and time into a grid, a “lattice,” and simulate physics on supercomputers. Lattice calculations are confined to tiny artificial boxes, though, and computational cost explodes as you add more nucleons.
A new paper from MIT’s Center for Theoretical Physics and IAIFI offers a smarter path. Rather than brute-forcing larger lattice calculations, the researchers use a bridge theory called pionless effective field theory, a simplified description that captures essential nuclear forces without tracking every quark and gluon, to translate small lattice results into predictions for bigger nuclei. They made that bridge dramatically more efficient by replacing a slow, random search with a modern AI technique: differentiable programming.
The result: the same accuracy as previous methods with roughly ten times fewer terms, and nuclear predictions extended to systems previously out of reach.
Key Insight: By applying differentiable programming, the mathematical engine behind neural network training, to nuclear wavefunction optimization, this work squeezes an order-of-magnitude efficiency improvement from the same underlying physics framework.
How It Works
The strategy has two stages. First, run a lattice QCD calculation of a small nucleus inside an artificial box. Those results carry finite-volume artifacts, distortions caused by the fact that particles in a box behave differently than particles in infinite space. Second, deploy finite-volume pionless effective field theory (FVEFT) to model those same box calculations, pin down the theory’s free parameters (called low-energy constants, or LECs, the adjustable dials controlling how strongly nucleons attract each other), then extrapolate to infinite volume and larger nuclei.
The tricky part is step two. FVEFT requires finding the wavefunction, a mathematical description of where all particles are likely to be, that gives the lowest possible energy. The standard approach, the stochastic variational method (SVM), builds a trial wavefunction by randomly proposing Gaussian “blobs” one at a time, keeping each only if it improves the energy estimate. It works, but requires hundreds or thousands of terms.

The new approach flips the script. Differentiable programming writes down a compact wavefunction using a fixed number of correlated Gaussian terms (multi-dimensional bell curves encoding how particles influence each other’s positions), then computes how energy changes with respect to every parameter. Gradient descent simultaneously optimizes all parameters, steering the wavefunction toward the true ground state. It’s the same mathematics that trains a neural network, applied to nuclear wavefunctions.
The key steps:
- Parameterize the trial wavefunction as a sum of correlated Gaussians with adjustable widths, centers, and correlations.
- Compute the energy as a differentiable function of those parameters using quantum mechanical expectation values.
- Backpropagate: automatically calculate how each parameter nudges the energy, and update all parameters simultaneously via gradient descent.
- Stack multiple optimized states and solve a generalized eigenvalue problem (GEVP) to extract the most accurate energy levels from overlapping approximate solutions.

Where SVM needs hundreds of Gaussian terms to represent the helium-4 (⁴He) ground state, differentiable programming achieves comparable accuracy with roughly ten times fewer. That compactness matters. Memory and compute scale badly with term count, so a 10× reduction opens calculations that were previously impossible.

The team validated the method by computing binding energies for nuclei with A∈{2,3,4} inside finite-volume boxes, matching them to existing lattice QCD results at an artificially heavy pion mass of m_π = 806 MeV. With LECs pinned down, they extrapolated to infinite volume and pushed up to A=5 and A=6, systems not previously reached with this framework.
Why It Matters
Nuclear physics from first principles and AI-driven optimization beyond machine learning are both hard problems on their own. This paper tackles both at once. The differentiable programming tools behind PyTorch and JAX have already reshaped computer vision, language modeling, and protein structure prediction. Here they do the same for quantum few-body physics.
With the FVEFT matching procedure now extended to A=5 and A=6, physicists have a longer bridge from lattice QCD toward the nuclear chart. As lattice QCD approaches physical quark masses and its systematic uncertainties tighten, this matching framework will be ready to absorb those results and propagate them to larger nuclei. The same differentiable variational approach could also target nuclear matrix elements relevant to neutrinoless double-beta decay searches, or other few-body quantum systems where compact wavefunction representations are prized.
Bottom Line: Replacing random sampling with gradient-based optimization cuts nuclear wavefunction calculation costs by an order of magnitude, enabling first-principles-anchored predictions for A=5 and A=6 nuclei and opening a systematic bridge from lattice QCD to nuclear physics.
IAIFI Research Highlights
This work brings differentiable programming, a cornerstone of modern AI, into pionless nuclear effective field theory. Gradient-based optimization directly accelerates quantum many-body calculations that previously relied on stochastic methods.
Differentiable programming turns out to be more than a neural network training trick. Applied to a demanding physics problem, it delivers order-of-magnitude efficiency gains over stochastic variational methods, reinforcing its value as a general-purpose scientific optimization tool.
By extending finite-volume EFT calculations to A=5 and A=6 nuclei matched to lattice QCD, this research moves closer to the long-term goal of predicting nuclear properties directly from the Standard Model's quark-gluon dynamics.
Future work will target physical pion masses and larger nuclei as lattice QCD matures; the full methodology is detailed in [arXiv:2202.03530](https://arxiv.org/abs/2202.03530).
Original Paper Details
Finite-Volume Pionless Effective Field Theory for Few-Nucleon Systems with Differentiable Programming
[2202.03530](https://arxiv.org/abs/2202.03530)
["Xiangkai Sun", "William Detmold", "Di Luo", "Phiala E. Shanahan"]
Finite-volume pionless effective field theory provides an efficient framework for the extrapolation of nuclear spectra and matrix elements calculated at finite volume in lattice QCD to infinite volume, and to nuclei with larger atomic number. In this work, it is demonstrated how this framework may be implemented via a set of correlated Gaussian wavefunctions optimised using differentiable programming and via solution of a generalised eigenvalue problem. This approach is shown to be significantly more efficient than a stochastic implementation of the variational method based on the same form of correlated Gaussian wavefunctions, yielding comparably accurate representations of the ground-state wavefunctions with an order of magnitude fewer terms. The efficiency of representation allows such calculations to be extended to larger systems than in previous work. The method is demonstrated through calculations of the binding energies of nuclei with atomic number $A\in\{2,3,4\}$ in finite volume, matched to lattice QCD calculations at quark masses corresponding to $m_π=806$ MeV, and infinite-volume effective field theory calculations of $A\in\{2,3,4,5,6\}$ systems based on this matching.