Building imaginary-time thermal field theory with artificial neural networks

Tian Xu; Lingxiao Wang; Lianyi He; Kai Zhou; Yin Jiang

doi:10.1088/1674-1137/ad5f80

Chinese Physics C> 2024, Vol. 48> Issue(10) : 103101 DOI: 10.1088/1674-1137/ad5f80

Building imaginary-time thermal field theory with artificial neural networks

Tian Xu ¹ ,
Lingxiao Wang ^2,5, ,
Lianyi He ^3, ,
Kai Zhou ^4,5 ,
Yin Jiang ^{1,,

,}

1.
Physics Department, Beihang University, 37 Xueyuan Rd, Beijing 100191, China
2.
Shanghai Research Center for Theoretical Nuclear Physics, National Natural Science Foundation of China, Fudan University, Shanghai 200438, China
3.
Department of Physics, Tsinghua University, Beijing 100084, China
4.
School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), Shenzhen 518172, China
5.
Frankfurt Institute for Advanced Studies, Ruth-Moufang-Str. 1, 60438 Frankfurt am Main, Germany

Abstract
HTML
Reference
Related

PDF

Abstract：
In this paper, we introduce a novel approach in quantum field theories to estimate actions using artificial neural networks (ANNs). The actions are estimated by learning system configurations governed by the Boltzmann factor, $ e^{-S} $, at different temperatures within the imaginary time formalism of thermal field theory. Specifically, we focus on the 0+1 dimensional quantum field with kink/anti-kink configurations to demonstrate the feasibility of the method. Continuous-mixture autoregressive networks (CANs) enable the construction of accurate effective actions with tractable probability density estimation. Our numerical results demonstrate that this methodology not only facilitates the construction of effective actions at specified temperatures but also adeptly estimates the action at intermediate temperatures using data from both lower and higher temperature ensembles. This capability is especially valuable for detailed exploration of phase diagrams.
- artificial neural network ,
- imaginary-time thermal field theory ,
- effective model ,
- probability estimation

References

[1]	S. Muroya, A. Nakamura, C. Nonaka et al., Prog. Theor. Phys. 110, 615 (2003), arXiv: hep-lat/0306031 doi: 10.1143/PTP.110.615
[2]	C. Ratti, Rept. Prog. Phys. 81(8), 084301 (2018), arXiv: 1804.07810[hep-lat] doi: 10.1088/1361-6633/aabb97
[3]	S. Durr et al. (BMW), Science 322, 1224 (2008), arXiv: 0906.3599[hep-lat] doi: 10.1126/science.1163233
[4]	E. Ballini, G. Clemente, M. D'Elia et al., PoS LATTICE2023, 224 (2024) doi: 10.22323/1.453.0224
[5]	J. Greensite, Lect. Notes Phys. 821, 1 (2011) doi: 10.1007/978-3-642-14382-3
[6]	J. Bardeen, L. N. Cooper, and J. R. Schrieffer, Phys. Rev. 108, 1175 (1957) doi: 10.1103/PhysRev.108.1175
[7]	J. M. Kosterlitz and D. J. Thouless, J. Phys. C 6, 1181 (1973) doi: 10.1088/0022-3719/6/7/010
[8]	Y. Nambu, Phys. Rev. D 10, 4262 (1974) doi: 10.1103/PhysRevD.10.4262
[9]	B. J. Harrington and H. K. Shepard, Phys. Rev. D 17, 2122 (1978) doi: 10.1103/PhysRevD.17.2122
[10]	H. Fritzsch, Phys. Lett. B 256, 75 (1991) doi: 10.1016/0370-2693(91)90221-B
[11]	T. Schäfer and E. V. Shuryak, Rev. Mod. Phys. 70, 323 (1998), arXiv: hep-ph/9610451[hep-ph] doi: 10.1103/RevModPhys.70.323
[12]	A. Altland and B. D. Simons, Condensed matter field theory (2010)
[13]	H. Sonoda and H. Suzuki, PTEP 2021(2), 023B05 (2021), arXiv: 2012.03568[hep-th] doi: 10.1093/ptep/ptab006
[14]	S. Schaefer et al. (ALPHA), Nucl. Phys. B 845, 93 (2011), arXiv: 1009.5228[hep-lat] doi: 10.1016/j.nuclphysb.2010.11.020
[15]	G. Pan and Z. Y. Meng, doi: 10.1016/B978-0-323-90800-9.00095-0, arXiv: 2204.08777[cond-mat.str-el]
[16]	J. Carrasquilla and R. G. Melko, Nature Phys. 13, 431 (2017), arXiv: 1605.01735[cond-mat.str-el] doi: 10.1038/nphys4035
[17]	G. Carleo and M. Troyer, Science 355(6325), 602 (2017) doi: 10.1126/science.aag2302
[18]	G. Carleo, I. Cirac, K. Cranmer et al., Rev. Mod. Phys. 91(4), 045002 (2019), arXiv: 1903.10563[physics.comp-ph] doi: 10.1103/RevModPhys.91.045002
[19]	K. Zhou, L. Wang, L. G. Pang et al., Prog. Part. Nucl. Phys. 135, 104084 (2024), arXiv: 2303.15136[hep-ph] doi: 10.1016/j.ppnp.2023.104084
[20]	D. Wu, L. Wang, and P. Zhang, Phys. Rev. Lett. 122(8), 080602 (2019) doi: 10.1103/PhysRevLett.122.080602
[21]	O. Sharir, Y. Levine, N. Wies et al., Phys. Rev. Lett. 124(2), 020503 (2020) doi: 10.1103/PhysRevLett.124.020503
[22]	D. Luo, Z. Chen, K. Hu et al., Phys. Rev. Res. 5(1), 013216 (2023) doi: 10.1103/PhysRevResearch.5.013216
[23]	A. Fujita, J. R. Sato, H. M. Garay-Malpartida et al., Modeling gene expression regulatory networks with the sparse vector autoregressive model, BMCsystemsbiology 1 , 1 (2007)
[24]	S. Goldman, J. Li, and C. W. Coley, Generating molecular fragmentation graphs with autoregressive neural networks, AnalyticalChemistry, (2024)
[25]	L. Wang, Y. Jiang, L. He et al., Chin. Phys. Lett. 39(12), 120502 (2022), arXiv: 2005.04857[cond-mat.dis-nn] doi: 10.1088/0256-307X/39/12/120502
[26]	L. Wang, Y. Jiang, and K. Zhou, arXiv: physics.comp-ph/2007.01037
[27]	P. E. Shanahan, A. Trewartha, and W. Detmold, Phys. Rev. D 97(9), 094506 (2018), arXiv: 1801.05784[hep-lat] doi: 10.1103/PhysRevD.97.094506
[28]	Y. T. Song, In Journal of Physics: Conference Series 2649 , 012055 (2023)
[29]	S. Blücher, L. Kades, J. M. Pawlowski et al., Phys. Rev. D 101(9), 094507 (2020), arXiv: 2003.01504[hep-lat] doi: 10.1103/PhysRevD.101.094507
[30]	M. Favoni, A. Ipp, D. I. Müller et al., Phys. Rev. Lett. 128(3), 3 (2022), arXiv: 2012.12901[hep-lat] doi: 10.1103/PhysRevLett.128.032003
[31]	B. Allen, Phys. Rev. D 33, 3640 (1986) doi: 10.1103/PhysRevD.33.3640
[32]	F. Bruckmann, Eur. Phys. J. ST 152, 61 (2007), arXiv: 0706.2269[hep-th] doi: 10.1140/epjst/e2007-00377-2
[33]	M. A. Lopez-Ruiz, T. Yepez-Martinez, A. Szczepaniak et al., Nucl. Phys. A 966, 324 (2017), arXiv: 1605.08017[nucl-th] doi: 10.1016/j.nuclphysa.2017.07.017
[34]	G. 't Hooft, arXiv: hep-th/0010225
[35]	S. Chen, O. Savchuk, S. Zheng et al., Phys. Rev. D 107(5), 056001 (2023), arXiv: 2211.03470[hep-lat] doi: 10.1103/PhysRevD.107.056001
[36]	T. Schäfer, arXiv: hep-lat/0411010
[37]	J. X. Pan and K. T. Fang, Maximum Likelihood Estimation, Growth Eurve Models and Statistical Diagnostics, pages 77-158, 2002
[38]	M. Germain, K. Gregor, I. Murray et al., arXiv: 1502.03509[cs.LG]
[39]	G. Aarts, B. Lucini, and C. Park, Phys. Rev. D 109(3), 034521 (2024), arXiv: 2309.15002[hep-lat] doi: 10.1103/PhysRevD.109.034521
[40]	D. L. Boyda, M. N. Chernodub, N. V. Gerasimeniuk et al., Phys. Rev. D 103(1), 014509 (2021), arXiv: 2009.10971[hep-lat] doi: 10.1103/PhysRevD.103.014509
[41]	A. Palermo, L. Anderlini, M. P. Lombardo et al., PoS LATTICE2021, 030 (2022), arXiv: 2111.05216[hep-lat] doi: 10.22323/1.396.0030
[42]	N. Sale, B. Lucini, and J. Giansiracusa, Phys. Rev. D 107(3), 034501 (2023), arXiv: 2207.13392[hep-lat] doi: 10.1103/PhysRevD.107.034501
[43]	D. Spitz, J. M. Urban, and J. M. Pawlowski, Phys. Rev. D 107(3), 034506 (2023), arXiv: 2208.03955[hep-lat] doi: 10.1103/PhysRevD.107.034506
[44]	D. Diakonov, N. Gromov, V. Petrov et al., Phys. Rev. D 70, 036003 (2004), arXiv: hep-th/0404042[hep-th] doi: 10.1103/PhysRevD.70.036003

[1]	Teng Ma , Jing Shu , Ming-Lei Xiao . Standard model effective field theory from on-shell amplitudes. Chinese Physics C, 2023, 47(2): 023105. doi: 10.1088/1674-1137/aca200
[2]	D. M. Habashy , Mahmoud Y. El-Bakry , Werner Scheinast , Mahmoud Hanafy . Entropy per Rapidity in Pb-Pb Central Collisions using Thermal and Artificial Neural Network (ANN) Models at LHC Energies. Chinese Physics C, 2022, 46(7): 073103. doi: 10.1088/1674-1137/ac5f9d
[3]	Kai-Wen Li , Xiu-Lei Ren , Li-Sheng Geng , Bing-Wei Long . Leading order relativistic hyperon-nucleon interactions in chiral effective field theory. Chinese Physics C, 2018, 42(1): 014105. doi: 10.1088/1674-1137/42/1/014105
[4]	Yan-Ling Li , Yong-Liang Ma , Mannque Rho . Nuclear axial currents from scale-chiral effective field theory. Chinese Physics C, 2018, 42(9): 094102. doi: 10.1088/1674-1137/42/9/094102
[5]	Masoumeh Mohamadian , Hossein Afarideh , Mitra Ghergherehchi . Optimized feed-forward neural-network algorithm trained for cyclotron-cavity modeling. Chinese Physics C, 2017, 41(1): 017003. doi: 10.1088/1674-1137/41/1/017003
[6]	S. Parsamehr , M. Mohsenzadeh . Gauge theory of massless spin-(3/2) field in de Sitter space-time. Chinese Physics C, 2016, 40(11): 113102. doi: 10.1088/1674-1137/40/11/113102
[7]	Cai-Xun Zhang , Shin-Ted Lin , Jian-Ling Zhao , Xun-Zhen Yu , Li Wang , Jing-Jun Zhu , Hao-Yang Xing . Discrimination of neutrons and γ-rays in liquid scintillator based on Elman neural network. Chinese Physics C, 2016, 40(8): 086204. doi: 10.1088/1674-1137/40/8/086204
[8]	LIU Xiao-Xia , LÜ Xiao-Rui , ZHU Yong-Sheng . Combined estimation for multi-measurements of branching ratio. Chinese Physics C, 2015, 39(10): 103001. doi: 10.1088/1674-1137/39/10/103001
[9]	HAN Jie , BAO Jing-Dong . Modified fusion probability by reflection boundary. Chinese Physics C, 2015, 39(5): 054104. doi: 10.1088/1674-1137/39/5/054104
[10]	WANG Dou , Philip Bambade , Kaoru Yokoya , GAO Jie . Analytical estimation of ATF beam halo distribution. Chinese Physics C, 2014, 38(12): 127003. doi: 10.1088/1674-1137/38/12/127003
[11]	YANG Xiao-Yu , XU Tao-Guang , FU Shi-Nian , ZENG Lei , BIAN Xiao-Juan . Classical and modern power spectrum estimation for tune measurement in CSNS RCS. Chinese Physics C, 2013, 37(11): 117003. doi: 10.1088/1674-1137/37/11/117003
[12]	REN Yan-Yu , Efaaf M , ZHANG Wei-Ning . Inhomogeneous space-time structure and two-pion interferometry in NEXSPHERIO model. Chinese Physics C, 2010, 34(4): 472-478. doi: 10.1088/1674-1137/34/4/010
[13]	FAN Guang-Wei , XU Wang , PAN Qiang-Yan , CAI Xiao-Lu , FAN Gong-Tao , LI Yong-Jiang , LUO Wen , XU Ben-Ji , YAN Zhe , YANG Li-Feng . Radius studies of ⁸Li and ⁸B using the optical-limit Glauber model in conjunction with relativistic mean-field theory. Chinese Physics C, 2010, 34(10): 1622-1627. doi: 10.1088/1674-1137/34/10/013
[14]	ZHANG Ying , LIANG Hao-Zhao , MENG Jie . First attempt to overcome the disaster of Dirac sea in imaginary time step method. Chinese Physics C, 2009, 33(S1): 113-115. doi: 10.1088/1674-1137/33/S1/036
[15]	WANG Si-Guang , MAO Ya-Jun , YE Hong-Xue . An artificial neural network for proton identification in HERMES data. Chinese Physics C, 2009, 33(3): 217-223. doi: 10.1088/1674-1137/33/3/011
[16]	ZHANG Kun-Shi , LIU Lian-Shou . On the Identification of Quark and Gluon Jets Using Artificial Neural Network Method. Chinese Physics C, 2004, 28(11): 1141-1145.
[17]	Deng Shenghua , Wang Enke , Li Jiarong . Two-Loop Effective Potential of the Non-topological Soliton Model at Finite Temperature. Chinese Physics C, 1995, 19(S3): 289-304.
[18]	Sun Changpu , Pang Lin , Ge Molin . Effective Topological Action in Heisenberg Spin Model as Berry's Phase. Chinese Physics C, 1992, 16(S1): 51-56.
[19]	Xu Xiaoming , Qiu Xijun . A Chiral Quark-Soliton Model Theory for Nuclear Force. Chinese Physics C, 1990, 14(S4): 381-389.
[20]	Yu Youwen , Shen Pengnian . The Effective Two-Meson-Exchange Potential Derived From the Quark-Antiquark Pair Creation Model. Chinese Physics C, 1989, 13(S2): 175-180.

Access

Figures(4)

Get Citation

Tian Xu, Lingxiao Wang, Lianyi He, Kai Zhou and Yin Jiang. Building imaginary-time thermal field theory with artificial neural networks[J]. Chinese Physics C. doi: 10.1088/1674-1137/ad5f80

Tian Xu, Lingxiao Wang, Lianyi He, Kai Zhou and Yin Jiang. Building imaginary-time thermal field theory with artificial neural networks[J]. Chinese Physics C. doi: 10.1088/1674-1137/ad5f80 shu

RIS(for EndNote,Reference Manager,ProCite)

BibTex

Txt

Milestone

Received: 2024-05-12

Article Metric

Article Views(3949)
PDF Downloads(24)
Cited by(0)

Policy on re-use

To reuse of Open Access content published by CPC, for content published under the terms of the Creative Commons Attribution 3.0 license (“CC CY”), the users don’t need to request permission to copy, distribute and display the final published version of the article and to create derivative works, subject to appropriate attribution.

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

HTML

I. INTRODUCTION

Lattice simulation is an important systematic method to solve strongly correlating and interacting systems in the framework of quantum field theory at finite temperatures [1, 2]. By sampling numerous configurations according to the action, physical observables are computed by ensemble averaging. The background physical mechanism can only be explored by certain integral quantities because of the large amount of configurations and unavoidable fluctuations during the sampling process [3, 4]. To test or realize a certain physical mechanism, different effective models have been developed. Such models are built with specific effective degrees of freedom (d.o.f) and by introducing proper interactions between them [5]. Starting from the fundamental theory, building an effective model is not usually straightforward because of the difficulty of choosing the key d.o.f and including the corresponding interactions. There are several successful examples, such as the Cooper pair for superconductivity, the vortex for the XY model, and various soliton solutions for quantum chromodynamics (QCD) [6−11]. Most of such key d.o.f are obtained by solving semi-classical equations. To complete the effective model, the fluctuations about the chosen d.o.f must be integrated out properly, which is usually a difficult and tedious task.

From the viewpoint of functional integration of quantum field theory, the Lagrangian density is encoded in the distribution of the configuration set [12]. The emerging probability of each configuration is determined by its action. If the correctly distributed ensemble is known, for example by coarse-graining the fundamental lattice configuration into the chosen effective one, the action can be extracted by estimating the probability of the configuration [13], which is an almost impossible task [14, 15] using traditional methods for such a high dimensional distribution. However, popular deep learning frameworks recently proposed are suitable for solving such problems [16−19]. Based on the variational ansatz that decomposes the probability of a lattice configuration into a conditional probability product, a class of autogressive networks has been developed for probability estimation issues [20−22]. For classical systems, such frameworks have already been introduced to study underlying interaction details of condensed matter, chemistry, and biology systems [23−26]. Although some attempts have been reported to learn the action in quantum lattice field theories [27, 29, 30], research on estimations based on external parameters, e.g., temperature dependence, remains notably scarce.

The imaginary-time thermal field theory reformulates quantum statistics by compacting the time direction onto an imaginary-time ring [31]. As a result, the complex weight $ \exp(i S) $ of a configuration becomes a real probability $ \exp(-S) $. If the integral is discretized over imaginary time, it can be found that the temperature dependences of kinetic and potential parts are different but explicit [32, 33]. This implies that following basically the same procedure as that in the classical case can work for quantum ones, except that more than one ensemble is needed to determine the whole phase diagram along the temperature axis. In this paper, we briefly review the classical case to clarify the paradigm of constructing an effective model with artificial neural networks (ANNs). Then, the quantum version is discussed. We show that if the potential part is independent on the imaginary time, only two ensembles at different temperature are enough to determine the action of each configuration if the Lagrangian density is composed by the sum of kinetic and local potential terms. We use an example of quantum mechanics (0+1D field theory) to show the application of the suggested procedure and adopt continuous-mixture autoregressive networks (CANs) to estimate the probability [25]. Numerical experiments demonstrate that to predict the action of one sample at a certain temperature, only two different ensembles suffice. In the interpolation case, i.e., when the predicted temperature falls in the range of two known temperatures, the proposed approach is optimal. This is acceptable because, when the phase structure is approximately characterized at two ends by an effective model, its estimation for intermediate states should not deviate too much from the correct value.

II. NEURAL NETWORKS FOR CLASSICAL STATISTICS

III. NEURAL NETWORKS FOR QUANTUM STATISTICS

V. CONTINUOUS-MIXTURE AUTOREGRESSIVE NETWORK (CAN) TO BUILD ACTIONS

VI. CONCLUSIONS

In this study, we propose a paradigm for constructing an effective model using ANNs once an ensemble of a certain d.o.f has been obtained. Utilizing CANs and a Higgs-like 0+1D quantum field model, we demonstrate the construction process. For this model, there is a topological phase transition from a state dominated by kinks and anti-kinks to a state without kinks as the temperature increases from low to high. Using ensembles generated by the traditional Markov Chain Monte Carlo (MCMC), the CANs successfully extracted the probability of each sample. By utilizing two trained networks at different temperatures, the action of a sample at an arbitrary third temperature was easily determined using Eq. (6). As expected, the predictions were most accurate when interpolating. This approach is beneficial for constructing a new effective model targeting specific d.o.f once the ensembles of the fundamental $d.o.f $ have been established.

This novel paradigm is particularly effective for investigating phase transitions, e.g., deconfinement, distinguishing it from other applications of supervised learning applications [30, 40, 41]. Additionally, it is more user-friendly compared to existing unsupervised methods [42, 43]. For instance, let us consider Quantum Chromodynamics (QCD) at finite temperature. It is suggested that around the critical temperature for deconfinement, a soliton solution of gluons, known as a dyon, dominates the system. With complicated computations, the gluon-quark system governed by QCD can be converted into a dyon-quark ensemble, wherein dyons interact in intricate ways. Although such a dyon ensemble has been obtained analytically after extensive efforts [44], further calculations regarding physical observables still rely on numerical simulation. Consequently, employing the methodology described in this paper will be advantageous for developing an effective model using ANNs.

If the ensemble of gluon configurations can be obtained through lattice simulation, it is possible to first transform the fundamental gluon field into multiple dyons and anti-dyons in space, thereby converting the gluon ensemble into a dyon ensemble. Subsequently, an effective numerical model can be developed by utilizing these dyon ensembles with the methodology described in this paper. As previously suggested, the numerical model must comprise two trained networks. At any given temperature, the action at a third temperature can be calculated. This action can then be utilized as if it were derived from the analytical dyon ensemble model. The same procedure can be applied to any effective d.o.f of various systems.

Reference (44)

Building imaginary-time thermal field theory with artificial neural networks

Abstract：

References

Access

Article Metrics

Metrics

通讯作者: 陈斌, bchen63@163.com

Email This Article

Building imaginary-time thermal field theory with artificial neural networks

Corresponding author: Yin Jiang, jiang_y@buaa.edu.cn

HTML

A. Action estimation

B. Action prediction

目录