Nuclear mass predictions based on a deep neural network and finite-range droplet model (2012)

To Chung Yiu; Haozhao Liang; Jenny Lee

doi:10.1088/1674-1137/ad021c

Chinese Physics C> 2024, Vol. 48> Issue(2) : 024102 DOI: 10.1088/1674-1137/ad021c

Nuclear mass predictions based on a deep neural network and finite-range droplet model (2012)

To Chung Yiu ^1, ,
Haozhao Liang ^{2,3,
,} ,
Jenny Lee ¹

1.
Department of Physics, The University of Hong Kong, Hong Kong 999077, China
2.
Department of Physics, Graduate School of Science, The University of Tokyo, Tokyo 113-0033, Japan
3.
Interdisciplinary Theoretical and Mathematical Sciences Program (iTHEMS), RIKEN, Wako 351-0198, Japan

Abstract
HTML
Reference
Related

PDF

Abstract：
A neural network with two hidden layers is developed for nuclear mass prediction, based on the finite-range droplet model (FRDM12). Different hyperparameters, including the number of hidden units, choice of activation functions, initializers, and learning rates, are adjusted explicitly and systematically. The resulting mass predictions are achieved by averaging the predictions given by several different sets of hyperparameters with different regularizers and seed numbers. This can provide not only the average values of mass predictions but also reliable estimations in the mass prediction uncertainties. The overall root-mean-square deviations of nuclear mass are reduced from 0.603 MeV for the FRDM12 model to 0.200 MeV and 0.232 MeV for the training and validation sets, respectively.
- nuclear mass ,
- machine learning ,
- deep neural network

References

[1]	D. Lunney, J. M. Pearson, and C. Thibault, Reviews of Modern Physics 75, 1021 (2003) doi: 10.1103/RevModPhys.75.1021
[2]	W. Huang, M. Wang, F. Kondev et al., Chin. Phys. C 45, 030002 (2021) doi: 10.1088/1674-1137/abddb0
[3]	M. Mumpower, R. Surman, G. McLaughlin et al., Progress in Particle and Nuclear Physics 86, 86 (2016) doi: 10.1016/j.ppnp.2015.09.001
[4]	E. M. Burbidge, G. R. Burbidge, W. A. Fowler et al., Reviews of Modern Physics 29, 547 (1957) doi: 10.1103/RevModPhys.29.547
[5]	D. Martin, A. Arcones, W. Nazarewicz et al., Phys.l Rev. Lett. 116, 121101 (2016) doi: 10.1103/PhysRevLett.116.121101
[6]	P. Möller, A. Sierk, T. Ichikawa et al., Atomic Data and Nuclear Data Tables 109- 109-110, 1 (2016) doi: 10.1016/j.adt.2015.10.002
[7]	S. Goriely and N. Chamel, Phys. Rev. Lett. 102, 152503 (2009) doi: 10.1103/PhysRevLett.102.152503
[8]	S. Goriely and N. Chamel, Phys. Rev. C 88, 024308 (2013) doi: 10.1103/PhysRevC.88.024308
[9]	P. W. Zhao, Z. P. Li, J. M. Yao et al., Phys. Rev. C 82, 054319 (2010) doi: 10.1103/PhysRevC.82.054319
[10]	N. Wang, M. Liu, X. Wu et al., Phys. Lett. B 734, 215 (2014) doi: 10.1016/j.physletb.2014.05.049
[11]	Z. Niu and H. Liang, Phys. Lett. B 778, 48 (2018) doi: 10.1016/j.physletb.2018.01.002
[12]	R. Pederson, B. Kalita, and K. Burke, Nature Reviews Physics 4, 357 (2022) doi: 10.1038/s42254-022-00470-2
[13]	H. J. Kulik, T. Hammerschmidt, J. Schmidt et al., Electronic Structure 4, 023004 (2022) doi: 10.1088/2516-1075/ac572f
[14]	A. Boehnlein, M. Diefenthaler, N. Sato, M. Schram et al., Reviews of Modern Physics 94, 031003 (2022) doi: 10.1103/RevModPhys.94.031003
[15]	Z. M. Niu, H. Z. Liang, B. H. Sun et al., Phys. Rev. C 99, 064307 (2019) doi: 10.1103/PhysRevC.99.064307
[16]	F. Minato, Z. Niu, and H. Liang, Phys. Rev. C 106, 024306 (2022) doi: 10.1103/PhysRevC.106.024306
[17]	C. Ma, Z. Li, Z. M. Niu et al., Phys. Rev. C 100, 024330 (2019) doi: 10.1103/PhysRevC.100.024330
[18]	D. Wu, C. L. Bai, H. Sagawa et al., Phys. Rev. C 102, 054323 (2020) doi: 10.1103/PhysRevC.102.054323
[19]	R.-D. Lasseri, D. Regnier, J.-P. Ebran et al., Phys. Rev. Lett. 124, 162502 (2020) doi: 10.1103/PhysRevLett.124.162502
[20]	R. Utama and J. Piekarewicz, Phys. Rev. C 96, 044308 (2017) doi: 10.1103/PhysRevC.96.044308
[21]	E. Yüksel, D. Soydaner, and H. Bahtiyar, International Journal of Modern Physics E: Nuclear Physics 30, 2150017 (2021) doi: 10.1142/S0218301321500178
[22]	Z. M. Niu and H. Z. Liang, Phys. Rev. C 106, L021303 (2022) doi: 10.1103/PhysRevC.106.L021303
[23]	J. W. CLARK and H. LI, International Journal of Modern Physics B 20, 5015 (2006) doi: 10.1142/S0217979206036053
[24]	M. Shelley and A. Pastore, A new mass model for nuclear astrophysics: Crossing 200 keV accuracy, Universe 7, 131 (2021)
[25]	Z.-P. Gao, Y.-J. Wang, H.-L. Lü et al., Machine learning the nuclear mass, Nuclear Science and Techniques 32 , https://doi.org/10.1007/s41365-021-00956-1 (2021). doi: 10.1007/s41365-021-00956-1
[26]	Z. M. Niu, Z. L. Zhu, Y. F. Niu et al., Phys. Rev. C 88, 024325 (2013) doi: 10.1103/PhysRevC.88.024325
[27]	X. Wu, L. Guo, and P. Zhao, Phys. Lett. B 819, 136387 (2021) doi: 10.1016/j.physletb.2021.136387
[28]	Y. Liu, C. Su, J. Liu et al., Phys. Rev. C 104, 014315 (2021) doi: 10.1103/PhysRevC.104.014315
[29]	D. Wu, C. L. Bai, H. Sagawa et al, Phys. Rev. C 104, 054303 (2021) doi: 10.1103/PhysRevC.104.054303
[30]	TensorFlow, Module: tf.keras.activations, https://www.tensorflow.org/api_docs/python/tf/keras/activations (2023), accessed: 2023-06-01.
[31]	M. W. Kirson, Nucl. Phys. A 798, 29 (2008) doi: 10.1016/j.nuclphysa.2007.10.011
[32]	TensorFlow, tf.keras.optimizers.adam, https://www.tensorflow.org/api_docs/python/tf/keras/optimizers/Adam (2023), accessed: 2023-06-01.
[33]	TensorFlow, Module: tf.keras.initializers, https://www.tensorflow.org/api_docs/python/tf/keras/initializers (2023), accessed: 2023-06-01.
[34]	TensorFlow, Module: tf.keras.regularizers, https://www.tensorflow.org/api_docs/python/tf/keras/regularizers (2023), accessed: 2023-06-01.

[1]	Zhi Long Li , Bing Feng Lv , Yong Jia Wang , C. M. Petrache . Study of yrast and yrare low-lying excited states using machine learning approaches. Chinese Physics C, 2026, 50(1): 014107. doi: 10.1088/1674-1137/adfe54
[2]	Liang Liu , Hai-Nan Lin , Li Tang . Revised classification of the CHIME fast radio bursts with machine learning. Chinese Physics C, 2026, 50(1): 015102. doi: 10.1088/1674-1137/ae0725
[3]	Yabo Dong , Manqi Ruan , Kun Wang , Haijun Yang , Jingya Zhu . Testing a 95 GeV Scalar at the CEPC with Machine Learning. Chinese Physics C, 2026, 50(3): 031001. doi: 10.1088/1674-1137/ae2ebc
[4]	Yongcheng Wu , Liang Xiao , Yan Zhang . Deep learning to improve the sensitivity of Higgs pair searches in the 4b channel at the LHC. Chinese Physics C, 2026, 50(3): 033105. doi: 10.1088/1674-1137/ae2454
[5]	X. Y. Zhang , W. F. Li , J. Y. Fang . Improving nuclear mass predictions by correcting mass residuals using eXtreme Gradient Boosting. Chinese Physics C, 2026, 50(4): 044101. doi: 10.1088/1674-1137/ae25cd
[6]	Qing Wu , Wei-Feng Li , Zhong-Ming Niu , Hao-Zhao Liang , Min Shi . Improvement of nuclear semi-empirical mass formula by including shell effect. Chinese Physics C, 2025, 49(11): 114103. doi: 10.1088/1674-1137/ade954
[7]	Jing Li , Hao Sun . HEP ML LAB: An end-to-end framework for applying machine learning to phenomenology studies. Chinese Physics C, 2025, 49(9): 093106. doi: 10.1088/1674-1137/addcc9
[8]	Jialei Wei , Ao Liu , Dejiang Li , Cuihong Wen . Physical parameter regression from black hole images using a multiscale adaptive neural network. Chinese Physics C, 2025, 49(12): 125105. doi: 10.1088/1674-1137/adf542
[9]	Ranran Guo , Xiaobing Li , Rui Wang , Shiyang Chen , Yuanfang Wu , Zhiming Li . Exploring percolation phase transition in the three-dimensional Ising model with machine learning. Chinese Physics C, 2025, 49(5): 054103. doi: 10.1088/1674-1137/adaa59
[10]	Shuang Qu , Jin-Yan Zhang , Man Bao . Nuclear mass predictions with a Bayesian neural network. Chinese Physics C, 2025, 49(10): 104106. doi: 10.1088/1674-1137/ade958
[11]	Xiaobing Li , Ranran Guo , Yu Zhou , Kangning Liu , Jia Zhao , Fen Long , Yuanfang Wu , Zhiming Li . Machine learning phase transitions of the three-dimensional Ising universality class. Chinese Physics C, 2023, 47(3): 034101. doi: 10.1088/1674-1137/aca5f5
[12]	Kai-Fang Pu , Han-Lin Li , Hong-Liang Lü , Long-Gang Pang . Solving Schrodinger equations using a physically constrained neural network. Chinese Physics C, 2023, 47(5): 054104. doi: 10.1088/1674-1137/acc518
[13]	Xiao-Kai Du , Peng Guo , Xin-Hui Wu , Shuang-Quan Zhang . Examination of machine learning for assessing physical effects: Learning the relativistic continuum mass table with kernel ridge regression. Chinese Physics C, 2023, 47(7): 074108. doi: 10.1088/1674-1137/acc791
[14]	Ziyi Yuan , Dong Bai , Zhongzhou Ren , Zhen Wang . Theoretical predictions on α-decay properties of some unknown neutron-deficient actinide nuclei using machine learning. Chinese Physics C, 2022, 46(2): 024101. doi: 10.1088/1674-1137/ac321c
[15]	Chun-Wang Ma , Xiao-Bao Wei , Xi-Xi Chen , Dan Peng , Yu-Ting Wang , Jie Pu , Kai-Xuan Cheng , Ya-Fei Guo , Hui-Ling Wei . Precise machine learning models for fragment production in projectile fragmentation reactions using Bayesian neural networks. Chinese Physics C, 2022, 46(7): 074104. doi: 10.1088/1674-1137/ac5efb
[16]	Mugeon Song , Maverick S. H. Oh , Yongjun Ahn , Keun-Young Kima . AdS/Deep-Learning made easy: simple examples. Chinese Physics C, 2021, 45(7): 073111. doi: 10.1088/1674-1137/abfc36
[17]	Masoumeh Mohamadian , Hossein Afarideh , Mitra Ghergherehchi . Optimized feed-forward neural-network algorithm trained for cyclotron-cavity modeling. Chinese Physics C, 2017, 41(1): 017003. doi: 10.1088/1674-1137/41/1/017003
[18]	Jing Tang , Zhong-Ming Niu , Jian-You Guo . Influence of binding energies of electrons on nuclear mass predictions. Chinese Physics C, 2016, 40(7): 074102. doi: 10.1088/1674-1137/40/7/074102
[19]	Cai-Xun Zhang , Shin-Ted Lin , Jian-Ling Zhao , Xun-Zhen Yu , Li Wang , Jing-Jun Zhu , Hao-Yang Xing . Discrimination of neutrons and γ-rays in liquid scintillator based on Elman neural network. Chinese Physics C, 2016, 40(8): 086204. doi: 10.1088/1674-1137/40/8/086204
[20]	WANG Si-Guang , MAO Ya-Jun , YE Hong-Xue . An artificial neural network for proton identification in HERMES data. Chinese Physics C, 2009, 33(3): 217-223. doi: 10.1088/1674-1137/33/3/011

Access

Figures(12) / Tables(1)

Get Citation

To Chung Yiu, Haozhao Liang and Jenny Lee. Nuclear mass predictions based on deep neural network and finite-range droplet model (2012)[J]. Chinese Physics C. doi: 10.1088/1674-1137/ad021c

To Chung Yiu, Haozhao Liang and Jenny Lee. Nuclear mass predictions based on deep neural network and finite-range droplet model (2012)[J]. Chinese Physics C. doi: 10.1088/1674-1137/ad021c shu

RIS(for EndNote,Reference Manager,ProCite)

BibTex

Txt

Milestone

Received: 2023-06-07

Article Metric

Article Views(3879)
PDF Downloads(48)
Cited by(0)

Policy on re-use

To reuse of subscription content published by CPC, the users need to request permission from CPC, unless the content was published under an Open Access license which automatically permits that type of reuse.

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

HTML

I. INTRODUCTION

Nuclear mass is one of the most fundamental nuclear properties [1, 2]. It not only represents the static properties of nuclei but also determines the reaction energies in different nuclear processes, such as β decay, neutron capture, and fission [3]. All these processes play important roles in the origin of elements and the abundance of elements in the universe [3−5].

Currently, there are approximately 2500 nuclei with experimental masses. These nuclei are estimated to be only 27.8% of around 9000 theoretically estimated bounded nuclei [2, 6]. To gain a better understanding of nuclear mass for all nuclei, theoretical mass models are required. There are several theoretical models for mass prediction. Some of them are microscopic models, such as Hartree–Fock–Bogoliubov (HFB) [7, 8] and relativistic mean-field (RMF) mass models [9], whereas others are macroscopic-microscopic models, such as the finite-range droplet [6] and Weizsäcker–Skyrme (WS) models [10]. These theoretical models give a root-mean-square (RMS) deviation of 0.3 to 2.3 MeV between the theoretical and experimental masses [11]. To reduce RMS deviation, which is critical for a better description of final element abundance [3, 4], a machine learning model can be developed.

Neural networks, a type of algorithm in machine learning, have been widely used in different research fields [12−14]. Several recent studies have proven that neural networks are able to improve the accuracy of models for several different nuclear properties, such as the β-decay half-lives [15, 16], neutron capture rate [17], nuclear charge radii [18], and ground-state and excited energies [19].

Neural networks have also been used in nuclear mass prediction in several previous studies [11,20−22]. For example, artificial neural networks (ANNs) [21], support vector machines (SVMs) [23], Bayesian neural networks (BNNs) [11], Bayesian machine learning (BML) [22], Gaussian process [24], light gradient-boosting machine (LightGBM) [25], the radial basis function (RBF/RBF-oe) [26], kernel ridge regression (KRR/KRR-oe) [27], and naive Bayesian probability classifier (NBP) [28] have been used in mass prediction. The ANN algorithm [21] was used to determine the effects of different numbers of hidden layers. The effects of different numbers of inputs were investigated using ANNs [21] and BNNs [11]. An SVM [23] was used to investigate the effects of the training, validation, and test set ratio. The BML model [22] was used with nine different BNNs in total to predict the nuclear mass. The Gaussian process [24] was found to reduce the RMS deviation of the nuclear mass to less than $ 200 $ keV and help flatten out the discrepancies of nuclear mass in exotic regions. The LightGBM model [25] including $ 10 $ input features was used to show the effects of different ratios of training and test sets. The RBF/RBF-oe approach [26] showed the correlation between the target nuclei and their surrounding nuclei, the KRR/KRR-oe approach [27] has been used on the odd-even effects of the nuclei, and the NBP model [28] was used to calculate the nuclear mass after classifying the residuals of the nuclei into different groups.

The machine learning methods listed above have different advantages. The Bayesian methods, including BNNs [11], BML [22], and the NBP [28], include probability distributions in the parameters; thus, they can avoid the over-fitting problem and provide a probability distribution for the mass prediction in unknown regions. The Gaussian process [24] is similar to the previous methods; however, it includes probability distributions over the sample functions and also gives a probability distribution for the mass prediction. SVMs [23] can determine the number of hidden units automatically so that the number of hyperparameters can be reduced. The LightGBM [25] can accelerate the training process, reduce the computational time, and allow us to check the importance of different inputs of the model toward the outputs. The RBF/RBF-oe approach [26] can predict the mass based on the distance from the target nuclei to the training set. The KRR/KRR-oe approach [27] can identify the limit of the extrapolation distance of the model automatically so that the worst nuclear mass predictions far from the experimental results can be avoided.

Meanwhile, each neural network in these studies included several types of hyperparameters, such as the number of hidden units, choice of activation functions, initializers, and learning rates [16, 29]. These hyperparameters play very important roles in both the training process and final performance of the neural network. In other words, it is essential to investigate such hyperparameters in an explicit and systematic way.

In this study, a deep neural network (DNN) is used to construct a mass model that improves upon the current finite-range droplet model [6]. Different hyperparameters are particularly investigated to achieve a better neural network model. Moreover, several different sets of hyperparameters are used together to predict the nuclear masses and provide the uncertainty of the result.

The details of the neural network used in this study are given in Sec. II. The results of the nuclear mass neural network model and its performance are discussed in Sec. III. Finally, a summary is presented in Sec. IV. All the hyperparameters adjusted in this study are given in Appendices A, B, C, and D.

IV. SUMMARY AND PERSPECTIVES

A DNN is applied to study the nuclear mass based on the FRDM12 model. Different hyperparameters, including activation functions, the learning rate, the number of hidden units, and initializers, are adjusted in a systematic way to achieve better model performance, as shown in Appendices A, B, C, and D. Finally, seven sets of hyperparameters with different regularizers and seed numbers have been selected. It is important that averaging the predictions given by several different sets of hyperparameters can provide not only the average values of mass predictions but also reliable estimations in the mass prediction uncertainties.

With the neural network, the RMS deviations between the experimental and theoretical masses are reduced from $ 0.603 $ to $ 0.200 $ MeV and $ 0.232 $ MeV for the training and validation sets, respectively. For most of the nuclei, this DNN model can give a better mass prediction compared with the FRDM12 model. Even for nuclei that have a poor mass prediction in the FRDM12 model, such as $ ^{41} $Cr and $ ^{42} $Cr, the DNN model can still reduce the RMS deviation and achieve a better mass prediction. However, there are still several nuclei with a worse mass prediction compared with the FRDM12 model, which may occur in the validation set with unsolved odd-even staggering problems. Further studies on these nuclei are required to improve the model, such as to provide a better description of the odd-even staggering in nuclei.

In the future, the same technique can be applied to other physics quantities related to nuclei. With the nuclear mass predicted in this study and by adjusting different hyperparameters, DNN models for different physics quantities, such as β-decay half-life and β-delayed neutron emission probability, can be generated and their performance can be compared with other current theoretical models to investigate whether this algorithm can improve the current predictions.

APPENDIX A: PERFORMANCE WITH DIFFERENT ACTIVATION FUNCTIONS

APPENDIX B: PERFORMANCE WITH DIFFERENT LEARNING RATES

APPENDIX C: PERFORMANCE WITH DIFFERENT NUMBERS OF HIDDEN UNITS

APPENDIX D: PERFORMANCE WITH DIFFERENT INITIALIZERS

Reference (34)

Hyperparameters	$ \sigma_{\rm training} $/MeV	$ \sigma_{\rm validation} $/MeV	$ \Delta\sigma_{\rm training} $ (%)	$ \Delta\sigma_{\rm validation} $ (%)
No regularizer, seed $ =3 $	$ 0.175 $	$ 0.228 $	$ 71.0 $	$ 62.2 $
L2 regularizer, $ \lambda=0.00001 $, seed $ =1 $	$ 0.204 $	$ 0.226 $	$ 66.2 $	$ 62.5 $
L2 regularizer, $ \lambda=0.00001 $, seed $ =2 $	$ 0.216 $	$ 0.235 $	$ 64.2 $	$ 61.0 $
Orthogonal regularizer, $ \lambda=0.1 $, seed $ =2 $	$ 0.212 $	$ 0.218 $	$ 64.8 $	$ 63.8 $
Orthogonal regularizer, $ \lambda=0.01 $, seed $ =1 $	$ 0.176 $	$ 0.238 $	$ 70.8 $	$ 60.5 $
Orthogonal regularizer, $ \lambda=0.001 $, seed $ =1 $	$ 0.219 $	$ 0.238 $	$ 63.7 $	$ 60.5 $
Orthogonal regularizer, $ \lambda=0.001 $, seed $ =2 $	$ 0.201 $	$ 0.238 $	$ 66.7 $	$ 60.5 $
Average of the above sets	$ 0.200 $	$ 0.232 $	$ 66.8 $	$ 61.5 $

Nuclear mass predictions based on a deep neural network and finite-range droplet model (2012)

Abstract：

References

Access

Article Metrics

Metrics

通讯作者: 陈斌, bchen63@163.com

Email This Article

Nuclear mass predictions based on a deep neural network and finite-range droplet model (2012)

HTML

目录