Solving Schrodinger equations using a physically constrained neural network

Kai-Fang Pu; Han-Lin Li; Hong-Liang Lü; Long-Gang Pang

doi:10.1088/1674-1137/acc518

Chinese Physics C> 2023, Vol. 47> Issue(5) : 054104 DOI: 10.1088/1674-1137/acc518

Solving Schrodinger equations using a physically constrained neural network

1.
College of Science, Wuhan University of Science and Technology, Wuhan 430065, China
2.
HiSilicon Research Department, Huawei Technologies Co., Ltd., Shenzhen 518000, China
3.
Key Laboratory of Quark and Lepton Physics (MOE) and Institute of Particle Physics, Central China Normal University, Wuhan 430079, China

Abstract
HTML
Reference
Related

PDF

Abstract：
Deep neural networks (DNNs) and auto differentiation have been widely used in computational physics to solve variational problems. When a DNN is used to represent the wave function and solve quantum many-body problems using variational optimization, various physical constraints have to be injected into the neural network by construction to increase the data and learning efficiency. We build the unitary constraint to the variational wave function using a monotonic neural network to represent the cumulative distribution function (CDF) $F(x) = \int_{-\infty}^{x} \psi^*\psi {\rm d}x'$. Using this constrained neural network to represent the variational wave function, we solve Schrodinger equations using auto-differentiation and stochastic gradient descent (SGD) by minimizing the violation of the trial wave function $ \psi(x) $ to the Schrodinger equation. For several classical problems in quantum mechanics, we obtain their ground state wave function and energy with very low errors. The method developed in the present paper may pave a new way for solving nuclear many-body problems in the future.
- deep neural network ,
- auto differentiation ,
- variational problems ,
- the cumulative distribution function ,
- ground state wave function

References

[1]	G. V. Cybenko, Mathematics of Control, Signals and Systems 2, 303 (1989) doi: 10.1007/BF02551274
[2]	A. Boehnlein et al., Rev. Mod. Phys. 94, 031003 (2022) doi: 10.1103/RevModPhys.94.031003
[3]	D. Saad, American Scientist 92, 578 (2004)
[4]	P. Mehta, M. Bukov, C.-H. Wang et al., Physics reports 810, 1 (2019) doi: 10.1016/j.physrep.2019.03.001
[5]	E. M. Nordhagen, J. M. Kim, B. Fore et al., arXiv: 2210.00365
[6]	B. R. Barrett, P. Navrátil, and J. P. Vary, Progress in Particle and Nuclear Physics 69, 131 (2013) doi: 10.1016/j.ppnp.2012.10.003
[7]	G. Torlai, G. Mazzola, J. Carrasquilla et al., Nature Physics 14, 447 (2018) doi: 10.1038/s41567-018-0048-5
[8]	C. Adams, G. Carleo, A. Lovato et al., Phys. Rev. Lett. 127, 022502 (2021) doi: 10.1103/PhysRevLett.127.022502
[9]	D. Pfau, J. S. Spencer, A. G. D. G. Matthews, and W. M. C. Foulkes, Phys. Rev. Res. 2, 033429 (2020) doi: 10.1103/PhysRevResearch.2.033429
[10]	M. Ruggeri, S. Moroni, and M. Holzmann, Phys. Rev. Lett. 120, 205302 (2018) doi: 10.1103/PhysRevLett.120.205302
[11]	J. Han, L. Zhang, and E. Weinan, Journal of Computational Physics 399, 108929 (2019) doi: 10.1016/j.jcp.2019.108929
[12]	S. Shi, K. Zhou, J. Zhao, S. Mukherjee, and P. Zhuang, Phys. Rev. D 105, 014017 (2022) doi: 10.1103/PhysRevD.105.014017
[13]	K. Choo, A. Mezzacapo, and G. Carleo, Nature communications 11, 2368 (2020) doi: 10.1038/s41467-020-15724-9
[14]	M. Scherbela, R. Reisenhofer, L. Gerard et al., Nature Computational Science 2, 331 (2022) doi: 10.1038/s43588-022-00228-x
[15]	Y. Yang and P. Zhao, arXiv: 2211.13998
[16]	R. P. Feynman, Rev. Mod. Phys. 20, 367 (1948) doi: 10.1103/RevModPhys.20.367
[17]	S. Chen, O. Savchuk, S. Zheng et al., Phys. Rev. D 107, 056001 (2023) doi: 10.1103/PhysRevD.107.056001
[18]	Y. Che, C. Gneiting, and F. Nori, Phys. Rev. B 105, 214205 (2022) doi: 10.1103/PhysRevB.105.214205
[19]	M. Raissi, P. Perdikaris, and G. E. Karniadakis, arXiv: 1711.10561
[20]	M. Raissi, P. Perdikaris, and G. E. Karniadakis, Journal of Computational physics 378, 686 (2019) doi: 10.1016/j.jcp.2018.10.045
[21]	E. Haghighat, M. Raissi, A. Moure et al., Computer Methods in Applied Mechanics and Engineering 379, 113741 (2021) doi: 10.1016/j.cma.2021.113741
[22]	J. Hendriks, C. Jidling, A. Wills et al., arXiv: 2002.01600
[23]	J. Hermann, Z. Schätzle, and F. Nóe, Nature Chemistry 12, 891 (2020) doi: 10.1038/s41557-020-0544-y
[24]	P. Hohenberg and W. Kohn, Phys. Rev. 136, B864 (1964) doi: 10.1103/PhysRev.136.B864
[25]	W. Kohn and L. J. Sham, Phys. Rev. 140, A1133 (1965) doi: 10.1103/PhysRev.140.A1133
[26]	M. S. Badar, S. Shamsi, J. Ahmed et al., Molecular dynamics simulations: concept, methods, and applications, in Transdisciplinarity (Springer, 2022), p. 131
[27]	D. Luo, G. Carleo, B. K. Clark et al., Phys. Rev. Lett. 127, 276402 (2021) doi: 10.1103/PhysRevLett.127.276402
[28]	H. J. Rothe, Lattice gauge theories: an introduction (Singapore: World Scientific Publishing Company, 2012), p. 628
[29]	R. Abbott, M. S. Albergo, A. Botev et al., arXiv: 2208.03832
[30]	J. Keeble and A. Rios, Phys. Lett. B 809, 135743 (2020) doi: 10.1016/j.physletb.2020.135743
[31]	H. Saito, Journal of the Physical Society of Japan 87, 074002 (2018) doi: 10.7566/JPSJ.87.074002
[32]	C. Giuseppe and M. Troyer, Science 355, 602 (2017) doi: 10.1126/science.aag2302
[33]	A. Paszke, S. Gross, S. Chintala et al., Automatic differentiation in pytorch (2017)
[34]	X. Glorot and Y. Bengio, Understanding the difficulty of training deep feedforward neural networks, in Proceedings of the thirteenth international conference on artificial intelligence and statistics (JMLR Workshop and Conference Proceedings, 2010), p. 249
[35]	I. Senitzky, Phys. Rev. 124, 642 (1961) doi: 10.1103/PhysRev.124.642
[36]	M. Capak and B. Gönül, Modern Physics Letters A 31, 1650134 (2016) doi: 10.1142/S0217732316501340
[37]	R. L. Karandikar, Sadhana 31, 81 (2006) doi: 10.1007/BF02719775
[38]	M. Abadi, A. Agarwal, P. Barham et al., arXiv: 1603.04467
[39]	D. P. Kingma and J. Ba, arXiv: 1412.6980

[1]	M.A. Matveev , A.T. Sitnikov , A.V. Sarantsev . Tensor formalism for the partial wave analysis of reactions with resonances decaying into four pseudoscalar mesons. Chinese Physics C, 2026, 50(3): 1-12.
[2]	Yongcheng Wu , Liang Xiao , Yan Zhang . Deep Learning to Improve the Sensitivity of Higgs Pair Searches in the 4b Channel at the LHC. Chinese Physics C, 2026, 50(3): 1-15.
[3]	Li Tang , Liang Liu , Ying Wu . Null test of cosmic curvature using deep learning method. Chinese Physics C, 2026, 50(1): 015107. doi: 10.1088/1674-1137/ae0b42
[4]	Renli Xu , Chen Wu , Jian Liu , Bin Hong , Jie Peng , Xiong Li , Ruxian Zhu , Zhizhen Zhao , Zhongzhou Ren . Ground-state properties of finite nuclei in relativistic Hartree-Bogoliubov theory with an improved quark mass density-dependent model. Chinese Physics C, 2026, 50(1): 014105. doi: 10.1088/1674-1137/ae0997
[5]	Jing-Juan Qi , Zhen-Yang Wang , Zhu-Feng Zhang , Xin-Heng Guo . The properties of the S-wave D_sD_s bound state. Chinese Physics C, 2026, 50(2): 1-8. doi: 10.1088/1674-1137/ae1195
[6]	Tao Li , Min Liu , Ning Wang . Proton separation energy predictions for proton-rich nuclei with the radial basis function approach and mirror symmetry. Chinese Physics C, 2026, 50(3): 1-6.
[7]	Wentao Zeng , Zehao Lin , Yiran Wang , Shuangquan Zhang , Jinniu Hu , Ying Zhang . Resonant states in the Schrödinger equation solved by the Green's function method. Chinese Physics C, 2026, 50(2): 1-10.
[8]	Muhammad Farhan Taseer , Subhash Singha . Impact of particle production mechanisms on pseudorapidity distribution and directed flow in Au+Au and Cu+Cu collisions at ${ \sqrt{{\boldsymbol s}_{\boldsymbol{ NN}}}}$ = 19.6 GeV using AMPT model. Chinese Physics C, 2025, 49(10): 104101. doi: 10.1088/1674-1137/ade660
[9]	Xiaobin Wang , Lei Chang . Unveiling the inner structure of the Pion’s first excited state. Chinese Physics C, 2025, 49(12): 123107. doi: 10.1088/1674-1137/adfc34
[10]	Wenchang Xiang , Yuanyuan Hu , Yanbing Cai , Mengliang Wang , Daicui Zhou . On possible implications of the exponential distribution of constituent quarks within proton at high energies. Chinese Physics C, 2025, 49(12): 124110. doi: 10.1088/1674-1137/adfc35
[11]	Mudassar Ahmed , Abdul Kabir , Jameel-Un Nabi , Laiba Hamid , Manzoor Ahmad . Bayesian-optimized CatBoost for Ground-State Nuclear Charge-Radius Prediction. Chinese Physics C, 2025, 50(3): 1-13.
[12]	Shuang Qu , Jin-Yan Zhang , Man Bao . Nuclear mass predictions with a Bayesian neural network. Chinese Physics C, 2025, 49(10): 104106. doi: 10.1088/1674-1137/ade958
[13]	Jialei Wei , Ao Liu , Dejiang Li , Cuihong Wen . Physical parameter regression from black hole images using a multiscale adaptive neural network. Chinese Physics C, 2025, 49(12): 125105. doi: 10.1088/1674-1137/adf542
[14]	LIANG Jun , LIU Yan-Chun , ZHU Qiao . Thermodynamics of noncommutative geometry inspired black holes based on Maxwell-Boltzmann smeared mass distribution. Chinese Physics C, 2014, 38(2): 025101. doi: 10.1088/1674-1137/38/2/025101
[15]	ZHANG Cong , HE Yuan , ZHAO Hong-Wei , ZHANG Sheng-Hu . Multipacting simulation and analysis of a taper quarter wave cavity by using Analyst-PT3P. Chinese Physics C, 2012, 36(4): 362-366. doi: 10.1088/1674-1137/36/4/012
[16]	GUO Yan-Qing , SONG Jie . Quantitative conditions for the formation of p-wave neutron halos. Chinese Physics C, 2011, 35(2): 158-162. doi: 10.1088/1674-1137/35/2/010
[17]	YANG Hong-Xun (for the BESⅡ collaboration) . Partial wave analysis of J/ψ→ppπ⁰ and measurement of J/ψ→ppη, ppη´. Chinese Physics C, 2009, 33(12): 1331-1335. doi: 10.1088/1674-1137/33/12/048
[18]	ZHANG Da-Lin , QIU Sui-Zheng , LIU Chang-Liang , SU Guang-Hui . Steady state investigation on neutronics of a molten salt reactor considering the flow effect of fuel salt. Chinese Physics C, 2008, 32(8): 624-628. doi: 10.1088/1674-1137/32/8/007
[19]	Yu Hong , Shen Qixing , Zhang Lin . Angular Distribution of Process e⁺e^－→τ⁺τ^－,τ^－→a₁υτ,a₁→ρπ and Characteristics of a₁ Meson. Chinese Physics C, 1994, 18(7): 583-590.
[20]	WU Zhong-Li . ANALYSIS OF THE MOMENTUM DISTRIBUTION WIDTH OF THE PREOJECTILE-LIKE-FRAGMENT FROM HEAVY ION COLLISIONS. Chinese Physics C, 1989, 13(8): 752-754.

Access

Figures(4) / Tables(1)

Get Citation

Kai-Fang Pu, Han-Lin Li, Hong-Liang Lü and Long-Gang Pang. Solving Schrodinger equations using physically constrained neural network[J]. Chinese Physics C. doi: 10.1088/1674-1137/acc518

Kai-Fang Pu, Han-Lin Li, Hong-Liang Lü and Long-Gang Pang. Solving Schrodinger equations using physically constrained neural network[J]. Chinese Physics C. doi: 10.1088/1674-1137/acc518 shu

RIS(for EndNote,Reference Manager,ProCite)

BibTex

Txt

Milestone

Received: 2023-01-01

Article Metric

Article Views(3682)
PDF Downloads(65)
Cited by(0)

Policy on re-use

To reuse of subscription content published by CPC, the users need to request permission from CPC, unless the content was published under an Open Access license which automatically permits that type of reuse.

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

HTML

I. INTRODUCTION

The universal approximation theorem of the deep neural network (DNN) [1] makes it powerful for representing a variational function $ y = f(x, \theta) $ with trainable parameters θ. In physics, this function can be used as solution of many different partial differential equations (PDEs) $ \hat{L} f = 0 $, such as Maxwell equations in the electromagnetic field, Navier-Stokes equations in fluid dynamics, Schrodinger equations in quantum mechanics, and Einstein field equations for gravity. The traditional way to solve this problem is to use physical models. These models face great challenges in solving inverse problems with complex geometric regions and high-dimensional space. Unlike these models, the deep learning method developed in this study provides a new direction to solve these problems. As the parameters of a DNN are initialized with random numbers, the variational function $ f(x, \theta) $ violates the PDEs, and the residuals $ \delta = |\hat{L} f| $ are usually the optimization objectives that can be minimized to the desired precision. In this way, many physical problems [2] are naturally mapped into optimization problems [3] that can be solved using the modern deep learning libraries.

The main advantages of machine learning are that (1) it directly establishes the function mapping between input and output data, and (2) ordinary differential equations (ODEs) and PDEs can be transformed into variational problems that can be solved using optimization. Machine learning can be helpful in finding low-dimensional manifolds in a high-dimensional space, which is crucial for the quantum many-body problem, which suffers from the curse of dimensionality. The associated disadvantage is that it is at an early stage of development and its applicability to computational physics has not been fully tested.

With strong information encapsulation capability, deep learning has been proved to be a powerful tool in solving quantum many-body problems [4–8]. The most typical application is to use the DNN to represent the wave function of quantum many-body states for many-electron systems [9]. In subsequent developments, artificial neural network (ANN) applications were extended to prototypical spin lattice systems and quantum systems in a continuous space [10–12]. Recently, machine learning has been used to deal with ab-initio problems [13–15]. The Feynman path integral [16] is another method for solving quantum state problems. Modern generative models can represent a probability distribution with high computational efficiency. A Fourier-flow generative model has been proposed to simulate the Feynman propagator and generate paths for quantum systems [17]. Further, Ref. [18] proposed a Feynman path generator that can estimate the Euclidean propagator and the ground state wave function with high accuracy.

PDEs usually have boundary and/or initial conditions. In an early study, these initial and boundary conditions were built into the neural network by construction, and the training objective was to minimize the residual δ alone. This method uses hard constraints such that $ f(x, \theta) $ satisfies the initial and boundary conditions automatically. It is thus quite data efficient. The recent physics informed neural network [19–21] uses soft constraints where the violations to initial and boundary conditions are also added to the training objective $L = |\hat{L} f| + \beta_1 |\delta_{BC}| + \beta_2 |\delta_{IC}|$.

Some variational functions should obey physical constraints. For example, in solving the Maxwell equations, the magnetic field represented by the DNN should be divergence free. To include this constraint, the paper "Linearly constrained neural network" proposes a DNN that produces a vector field $ \vec{A}(x, y, z, \theta) $ whose curl $ \nabla \times \vec{A} $ is divergence free [22]. It is thus also possible to construct a scalar field $ \phi(x, y, z, \theta) $ whose gradients $ (\partial_x \phi, \partial_y \phi, \partial_z \phi) $ are curl free. Actually, a general method has been developed to construct neural networks with linear constraints. In solving the many-body Schrodinger equations, the many fermion wave function should be anti-symmetric. FermiNet and PauliNet use the Slater determinant to construct DNNs that are anti-symmetric. [23] In DFT [24–26] and molecular dynamics [26], the local chemical environment usually has translational or rotational symmetry that is considered using a gauge equivalent neural network [27]. In the lattice gauge field theory [28], gauge equivariant normalizing flows are employed to sample field configurations [29].

In the present work, we use a monotonic neural network to represent the cumulative distribution function $\int_{-\infty}^{x} f(x') {\rm d}x'$, whose first order derivative is the probability density $ f(x) = \psi^*(x)\psi(x) $ that gives the ground state wave function. The present paper demonstrates that a neural network with physical constraints can be used as efficient trial wave functions of Schrodinger equations. Auto-diff helps to compute the required derivatives of the trial function with respect to the input variables. In this way, optimizing the violation of the trial function to PDEs allows solving the PDEs with high accuracy. Compared to previous methods, our method does not need to calculate any numerical integrals in the whole calculation and the unitary constraint we impose on the variational wave function increases the data learning efficiency. The improved algorithm greatly reduces the amount of computation required to solve the same Schrodinger equation. These advantages make our method more suitable for dealing with many-body states, which require a huge amount of computation.

IV. CONCLUSIONS

In the present study, we used a physics-based neural network to solve Schrodinger equations numerically. We designed a monotonic neural network to represent the CDF of the ground state wave function. In this way, the wave function represented by the DNN satisfies the normalization condition by design. The variational optimization is reduced to an optimization problem by minimizing the violation of the trial wave function and trial ground state energy $ E_0 $ to Schrodinger equations. The method is used to solve Schrodinger equations with three different potentials, the harmonic oscillator, the Woods-Saxon potential, and the infinitely high potential well, all with a small relative error.

Compared to traditional variational methods in solving quantum mechanical problems, the trial wave function represented by the DNN does not have fixed function forms before training. The training objective is different from the traditional $ E_0 = \dfrac{\langle \psi | H | \psi \rangle}{\langle \psi | \psi \rangle } $, where numerical integration is required for both the numerator and denominator. In our case, the objective is to minimize the violation to the Schrodinger equation on sampled spatial coordinates. As the neural network is constrained, the trial wave function is normalized by construction. Our method is also different from the previous Schrodinger equation solver using supervised learning, where ground state energies from numerical solutions are needed to train the neural network. In another DNN Schrodinger solver [30, 31], the initial values of the network parameters greatly affect the optimization results. To avoid strong fluctuations, they provide a trial wave function whose form is close to the exact solution. The disadvantage of the previous algorithms is that they can only solve problems in which the form of the exact solution of the equation is known. Our algorithm can directly ignore the pre-training process, so we do not need to know any information of the exact solution before training. This is more universal and provides the possibility to solve problems that have never been dealt with before. In addition, we observe that our DNN can approximate the ground state wave function with fewer trainable parameters. Moreover, the physical constraints constructed in the neural network make the current method quite data efficient. Thus, we can achieve higher accuracy with less computation.

The current method can be improved in several ways. First, the CDF works for wave functions in high dimensional space as long as the n-dim spatial coordinates are flattened. Second, the spatial coordinates used for training can be sampled using the learned wave function or through active learning, to increase the training efficiency. Third, the anti-symmetric constraints of the wave function should be considered for many fermion systems. Although further efforts have to be done to improve the current method, it shows good properties in solving classical quantum mechanical problems. The next step is to solve the ground state energy and wave functions of the deuteron. It also paves a new way in solving many nucleon problems.

ACKNOWLEDGMENTS

LG Pang and KF Pu acknowledge the support provided by Huawei Technologies Co., Ltd. The contributions of Dr. Hong-Liang Lü are non-Huawei achievements. The computations were performed at the NSC3 super cluster at CCNU and High-Performance Computing Center of Wuhan University of Science and Technology.

Reference (39)

$N_{\rm unit}N_{\rm layer}$	1	2	3	4
4	0.9995717	0.9999767	0.9999705	0.9999618
8	0.9999416	0.9999797	0.9999910	0.9999932
16	0.9999861	0.9999923	0.9999936	0.9999967
32	0.9999789	0.9999909	0.9999896	0.9999922
64	0.9999744	0.9999746	0.9999903	0.9999941

Solving Schrodinger equations using a physically constrained neural network

Abstract：

References

Access

Article Metrics

Metrics

通讯作者: 陈斌, bchen63@163.com

Email This Article

Solving Schrodinger equations using a physically constrained neural network

HTML

目录