Nuclear mass predictions based on a deep neural network and finite-range droplet model (2012)

To Chung Yiu; Haozhao Liang; Jenny Lee

doi:10.1088/1674-1137/ad021c

Chinese Physics C> 2024, Vol. 48> Issue(2) : 024102 DOI: 10.1088/1674-1137/ad021c

Nuclear mass predictions based on a deep neural network and finite-range droplet model (2012)

To Chung Yiu ^1, ,
Haozhao Liang ^{2,3,
,} ,
Jenny Lee ¹

1.
Department of Physics, The University of Hong Kong, Hong Kong 999077, China
2.
Department of Physics, Graduate School of Science, The University of Tokyo, Tokyo 113-0033, Japan
3.
Interdisciplinary Theoretical and Mathematical Sciences Program (iTHEMS), RIKEN, Wako 351-0198, Japan

Abstract
HTML
Reference
Related

PDF

Abstract：
A neural network with two hidden layers is developed for nuclear mass prediction, based on the finite-range droplet model (FRDM12). Different hyperparameters, including the number of hidden units, choice of activation functions, initializers, and learning rates, are adjusted explicitly and systematically. The resulting mass predictions are achieved by averaging the predictions given by several different sets of hyperparameters with different regularizers and seed numbers. This can provide not only the average values of mass predictions but also reliable estimations in the mass prediction uncertainties. The overall root-mean-square deviations of nuclear mass are reduced from 0.603 MeV for the FRDM12 model to 0.200 MeV and 0.232 MeV for the training and validation sets, respectively.
- nuclear mass ,
- machine learning ,
- deep neural network

References

[1]	D. Lunney, J. M. Pearson, and C. Thibault, Reviews of Modern Physics 75, 1021 (2003) doi: 10.1103/RevModPhys.75.1021
[2]	W. Huang, M. Wang, F. Kondev et al., Chin. Phys. C 45, 030002 (2021) doi: 10.1088/1674-1137/abddb0
[3]	M. Mumpower, R. Surman, G. McLaughlin et al., Progress in Particle and Nuclear Physics 86, 86 (2016) doi: 10.1016/j.ppnp.2015.09.001
[4]	E. M. Burbidge, G. R. Burbidge, W. A. Fowler et al., Reviews of Modern Physics 29, 547 (1957) doi: 10.1103/RevModPhys.29.547
[5]	D. Martin, A. Arcones, W. Nazarewicz et al., Phys.l Rev. Lett. 116, 121101 (2016) doi: 10.1103/PhysRevLett.116.121101
[6]	P. Möller, A. Sierk, T. Ichikawa et al., Atomic Data and Nuclear Data Tables 109- 109-110, 1 (2016) doi: 10.1016/j.adt.2015.10.002
[7]	S. Goriely and N. Chamel, Phys. Rev. Lett. 102, 152503 (2009) doi: 10.1103/PhysRevLett.102.152503
[8]	S. Goriely and N. Chamel, Phys. Rev. C 88, 024308 (2013) doi: 10.1103/PhysRevC.88.024308
[9]	P. W. Zhao, Z. P. Li, J. M. Yao et al., Phys. Rev. C 82, 054319 (2010) doi: 10.1103/PhysRevC.82.054319
[10]	N. Wang, M. Liu, X. Wu et al., Phys. Lett. B 734, 215 (2014) doi: 10.1016/j.physletb.2014.05.049
[11]	Z. Niu and H. Liang, Phys. Lett. B 778, 48 (2018) doi: 10.1016/j.physletb.2018.01.002
[12]	R. Pederson, B. Kalita, and K. Burke, Nature Reviews Physics 4, 357 (2022) doi: 10.1038/s42254-022-00470-2
[13]	H. J. Kulik, T. Hammerschmidt, J. Schmidt et al., Electronic Structure 4, 023004 (2022) doi: 10.1088/2516-1075/ac572f
[14]	A. Boehnlein, M. Diefenthaler, N. Sato, M. Schram et al., Reviews of Modern Physics 94, 031003 (2022) doi: 10.1103/RevModPhys.94.031003
[15]	Z. M. Niu, H. Z. Liang, B. H. Sun et al., Phys. Rev. C 99, 064307 (2019) doi: 10.1103/PhysRevC.99.064307
[16]	F. Minato, Z. Niu, and H. Liang, Phys. Rev. C 106, 024306 (2022) doi: 10.1103/PhysRevC.106.024306
[17]	C. Ma, Z. Li, Z. M. Niu et al., Phys. Rev. C 100, 024330 (2019) doi: 10.1103/PhysRevC.100.024330
[18]	D. Wu, C. L. Bai, H. Sagawa et al., Phys. Rev. C 102, 054323 (2020) doi: 10.1103/PhysRevC.102.054323
[19]	R.-D. Lasseri, D. Regnier, J.-P. Ebran et al., Phys. Rev. Lett. 124, 162502 (2020) doi: 10.1103/PhysRevLett.124.162502
[20]	R. Utama and J. Piekarewicz, Phys. Rev. C 96, 044308 (2017) doi: 10.1103/PhysRevC.96.044308
[21]	E. Yüksel, D. Soydaner, and H. Bahtiyar, International Journal of Modern Physics E: Nuclear Physics 30, 2150017 (2021) doi: 10.1142/S0218301321500178
[22]	Z. M. Niu and H. Z. Liang, Phys. Rev. C 106, L021303 (2022) doi: 10.1103/PhysRevC.106.L021303
[23]	J. W. CLARK and H. LI, International Journal of Modern Physics B 20, 5015 (2006) doi: 10.1142/S0217979206036053
[24]	M. Shelley and A. Pastore, A new mass model for nuclear astrophysics: Crossing 200 keV accuracy, Universe 7, 131 (2021)
[25]	Z.-P. Gao, Y.-J. Wang, H.-L. Lü et al., Machine learning the nuclear mass, Nuclear Science and Techniques 32 , https://doi.org/10.1007/s41365-021-00956-1 (2021). doi: 10.1007/s41365-021-00956-1
[26]	Z. M. Niu, Z. L. Zhu, Y. F. Niu et al., Phys. Rev. C 88, 024325 (2013) doi: 10.1103/PhysRevC.88.024325
[27]	X. Wu, L. Guo, and P. Zhao, Phys. Lett. B 819, 136387 (2021) doi: 10.1016/j.physletb.2021.136387
[28]	Y. Liu, C. Su, J. Liu et al., Phys. Rev. C 104, 014315 (2021) doi: 10.1103/PhysRevC.104.014315
[29]	D. Wu, C. L. Bai, H. Sagawa et al, Phys. Rev. C 104, 054303 (2021) doi: 10.1103/PhysRevC.104.054303
[30]	TensorFlow, Module: tf.keras.activations, https://www.tensorflow.org/api_docs/python/tf/keras/activations (2023), accessed: 2023-06-01.
[31]	M. W. Kirson, Nucl. Phys. A 798, 29 (2008) doi: 10.1016/j.nuclphysa.2007.10.011
[32]	TensorFlow, tf.keras.optimizers.adam, https://www.tensorflow.org/api_docs/python/tf/keras/optimizers/Adam (2023), accessed: 2023-06-01.
[33]	TensorFlow, Module: tf.keras.initializers, https://www.tensorflow.org/api_docs/python/tf/keras/initializers (2023), accessed: 2023-06-01.
[34]	TensorFlow, Module: tf.keras.regularizers, https://www.tensorflow.org/api_docs/python/tf/keras/regularizers (2023), accessed: 2023-06-01.

[1]	Yu Jia , Jichen Pan , Jia-Yue Zhang . Soft pattern of gravitational Rutherford scattering from heavy target mass expansion. Chinese Physics C, 2026, 50(1): 013102. doi: 10.1088/1674-1137/ad62d6
[2]	Ali Çiçi , Hüseyin Dağ . Resolving the W boson Mass in the Lepton Specific Two Higgs Doublet Model. Chinese Physics C, 2026, 50(3): 1-19.
[3]	Xiao-Yu Xu , X. Q. Qi , L. Deng , A. X. Chen , H. K. Wang , Y. B. Qian . Uncertainty analysis of the nuclear liquid drop model. Chinese Physics C, 2026, 50(3): 1-9. doi: 10.1088/1674-1137/ae1444
[4]	Tao-Feng Wang , Zi-Ming Li , Xiao-Ting Yang , Min-Liang Liu , Jian-Song Wang , Yan-Yun Yang , Zhi-Yu Sun , Cheng-Jian Lin , Qing-Hua He , Zhen Bai , Fang-Fang Duan , Zhi-Hao Gao , Song Guo , Yue Hu , Wei Jiang , F. Kobayashi , Chen-Gui Lu , Jun-Bing Ma , Peng Ma , Jian-Guo Wang , Xiang-Lun Wei , He-Run Yang , Yong-Jin Yao , Jun-Wei Zhang . Nuclear Medium Effects for Modifying Tensor Interaction. Chinese Physics C, 2026, 50(3): 1-6. doi: 10.1088/1674-1137/ae167d
[5]	Li Tang , Liang Liu , Ying Wu . Null test of cosmic curvature using deep learning method. Chinese Physics C, 2026, 50(1): 015107. doi: 10.1088/1674-1137/ae0b42
[6]	Yi Wei Hao , Yi Fei Niu , Zhong Ming Niu . The role of fission in mass sensitivity study of the r-process. Chinese Physics C, 2026, 50(1): 014106. doi: 10.1088/1674-1137/adfe55
[7]	Zhi Long Li , Bing Feng Lv , Yong Jia Wang , C. M. Petrache . Study of yrast and yrare low-lying excited states using machine learning approaches. Chinese Physics C, 2026, 50(1): 014107. doi: 10.1088/1674-1137/adfe54
[8]	Liang Liu , Hai-Nan Lin , Li Tang . Revised classification of the CHIME fast radio bursts with machine learning. Chinese Physics C, 2026, 50(1): 015102. doi: 10.1088/1674-1137/ae0725
[9]	Yongcheng Wu , Liang Xiao , Yan Zhang . Deep Learning to Improve the Sensitivity of Higgs Pair Searches in the 4b Channel at the LHC. Chinese Physics C, 2026, 50(3): 1-15.
[10]	Xiaoxuan Lin , Wei Kou , Shixin Fu , Rong Wang , Chengdong Han , Xurong Chen . Revisiting the deuteron mass radius via near-threshold ρ⁰, ω, and ϕ meson photoproduction. Chinese Physics C, 2025, 49(10): 103105. doi: 10.1088/1674-1137/ade1c9
[11]	Hai-Jun Li , Yu-Feng Zhou . Mass mixing between QCD axions. Chinese Physics C, 2025, 49(11): 115101. doi: 10.1088/1674-1137/aded01
[12]	Qing Wu , Wei-Feng Li , Zhong-Ming Niu , Hao-Zhao Liang , Min Shi . Erratum: Improvement of nuclear semi-empirical mass formula by including shell effect (Chin. Phys. C, 49(11): 114103 (2025)). Chinese Physics C, 2025, 49(12): 129001. doi: 10.1088/1674-1137/ae23a6
[13]	Mudassar Ahmed , Abdul Kabir , Jameel-Un Nabi , Laiba Hamid , Manzoor Ahmad . Bayesian-optimized CatBoost for Ground-State Nuclear Charge-Radius Prediction. Chinese Physics C, 2025, 50(3): 1-13.
[14]	Qing Wu , Wei-Feng Li , Zhong-Ming Niu , Hao-Zhao Liang , Min Shi . Improvement of nuclear semi-empirical mass formula by including shell effect. Chinese Physics C, 2025, 49(11): 114103. doi: 10.1088/1674-1137/ade954
[15]	Jialei Wei , Ao Liu , Dejiang Li , Cuihong Wen . Physical parameter regression from black hole images using a multiscale adaptive neural network. Chinese Physics C, 2025, 49(12): 125105. doi: 10.1088/1674-1137/adf542
[16]	Shuang Qu , Jin-Yan Zhang , Man Bao . Nuclear mass predictions with a Bayesian neural network. Chinese Physics C, 2025, 49(10): 104106. doi: 10.1088/1674-1137/ade958
[17]	Peng Yin , Jianmin Dong , Wei Zuo . Effect of tensor correlations on the depletion of nuclear Fermi sea within the extended BHF approach. Chinese Physics C, 2017, 41(11): 114102. doi: 10.1088/1674-1137/41/11/114102
[18]	LIANG Jun , LIU Yan-Chun , ZHU Qiao . Thermodynamics of noncommutative geometry inspired black holes based on Maxwell-Boltzmann smeared mass distribution. Chinese Physics C, 2014, 38(2): 025101. doi: 10.1088/1674-1137/38/2/025101
[19]	M. Bashkanov (for the WASA-at-COSY collaboration) . ABC effect in double-pionic nuclear fusion and a pn resonance as its possible origin. Chinese Physics C, 2010, 34(9): 1339-1341. doi: 10.1088/1674-1137/34/9/037
[20]	YE Yan-Lin , ZHOU Xiao-Hong , LIU Wei-Ping , MA Yu-Gang , ZHANG Yu-Hu , XU Fu-Rong . Outline of the progress in the study of radioactive nuclear physics and super-heavy nuclei. Chinese Physics C, 2008, 32(S2): 1-7.

Access

Figures(12) / Tables(1)

Get Citation

To Chung Yiu, Haozhao Liang and Jenny Lee. Nuclear mass predictions based on deep neural network and finite-range droplet model (2012)[J]. Chinese Physics C. doi: 10.1088/1674-1137/ad021c

To Chung Yiu, Haozhao Liang and Jenny Lee. Nuclear mass predictions based on deep neural network and finite-range droplet model (2012)[J]. Chinese Physics C. doi: 10.1088/1674-1137/ad021c shu

RIS(for EndNote,Reference Manager,ProCite)

BibTex

Txt

Milestone

Received: 2023-06-07

Article Metric

Article Views(2816)
PDF Downloads(44)
Cited by(0)

Policy on re-use

To reuse of subscription content published by CPC, the users need to request permission from CPC, unless the content was published under an Open Access license which automatically permits that type of reuse.

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

HTML

I. INTRODUCTION

Nuclear mass is one of the most fundamental nuclear properties [1, 2]. It not only represents the static properties of nuclei but also determines the reaction energies in different nuclear processes, such as β decay, neutron capture, and fission [3]. All these processes play important roles in the origin of elements and the abundance of elements in the universe [3−5].

Currently, there are approximately 2500 nuclei with experimental masses. These nuclei are estimated to be only 27.8% of around 9000 theoretically estimated bounded nuclei [2, 6]. To gain a better understanding of nuclear mass for all nuclei, theoretical mass models are required. There are several theoretical models for mass prediction. Some of them are microscopic models, such as Hartree–Fock–Bogoliubov (HFB) [7, 8] and relativistic mean-field (RMF) mass models [9], whereas others are macroscopic-microscopic models, such as the finite-range droplet [6] and Weizsäcker–Skyrme (WS) models [10]. These theoretical models give a root-mean-square (RMS) deviation of 0.3 to 2.3 MeV between the theoretical and experimental masses [11]. To reduce RMS deviation, which is critical for a better description of final element abundance [3, 4], a machine learning model can be developed.

Neural networks, a type of algorithm in machine learning, have been widely used in different research fields [12−14]. Several recent studies have proven that neural networks are able to improve the accuracy of models for several different nuclear properties, such as the β-decay half-lives [15, 16], neutron capture rate [17], nuclear charge radii [18], and ground-state and excited energies [19].

Neural networks have also been used in nuclear mass prediction in several previous studies [11,20−22]. For example, artificial neural networks (ANNs) [21], support vector machines (SVMs) [23], Bayesian neural networks (BNNs) [11], Bayesian machine learning (BML) [22], Gaussian process [24], light gradient-boosting machine (LightGBM) [25], the radial basis function (RBF/RBF-oe) [26], kernel ridge regression (KRR/KRR-oe) [27], and naive Bayesian probability classifier (NBP) [28] have been used in mass prediction. The ANN algorithm [21] was used to determine the effects of different numbers of hidden layers. The effects of different numbers of inputs were investigated using ANNs [21] and BNNs [11]. An SVM [23] was used to investigate the effects of the training, validation, and test set ratio. The BML model [22] was used with nine different BNNs in total to predict the nuclear mass. The Gaussian process [24] was found to reduce the RMS deviation of the nuclear mass to less than $ 200 $ keV and help flatten out the discrepancies of nuclear mass in exotic regions. The LightGBM model [25] including $ 10 $ input features was used to show the effects of different ratios of training and test sets. The RBF/RBF-oe approach [26] showed the correlation between the target nuclei and their surrounding nuclei, the KRR/KRR-oe approach [27] has been used on the odd-even effects of the nuclei, and the NBP model [28] was used to calculate the nuclear mass after classifying the residuals of the nuclei into different groups.

The machine learning methods listed above have different advantages. The Bayesian methods, including BNNs [11], BML [22], and the NBP [28], include probability distributions in the parameters; thus, they can avoid the over-fitting problem and provide a probability distribution for the mass prediction in unknown regions. The Gaussian process [24] is similar to the previous methods; however, it includes probability distributions over the sample functions and also gives a probability distribution for the mass prediction. SVMs [23] can determine the number of hidden units automatically so that the number of hyperparameters can be reduced. The LightGBM [25] can accelerate the training process, reduce the computational time, and allow us to check the importance of different inputs of the model toward the outputs. The RBF/RBF-oe approach [26] can predict the mass based on the distance from the target nuclei to the training set. The KRR/KRR-oe approach [27] can identify the limit of the extrapolation distance of the model automatically so that the worst nuclear mass predictions far from the experimental results can be avoided.

Meanwhile, each neural network in these studies included several types of hyperparameters, such as the number of hidden units, choice of activation functions, initializers, and learning rates [16, 29]. These hyperparameters play very important roles in both the training process and final performance of the neural network. In other words, it is essential to investigate such hyperparameters in an explicit and systematic way.

In this study, a deep neural network (DNN) is used to construct a mass model that improves upon the current finite-range droplet model [6]. Different hyperparameters are particularly investigated to achieve a better neural network model. Moreover, several different sets of hyperparameters are used together to predict the nuclear masses and provide the uncertainty of the result.

The details of the neural network used in this study are given in Sec. II. The results of the nuclear mass neural network model and its performance are discussed in Sec. III. Finally, a summary is presented in Sec. IV. All the hyperparameters adjusted in this study are given in Appendices A, B, C, and D.

IV. SUMMARY AND PERSPECTIVES

A DNN is applied to study the nuclear mass based on the FRDM12 model. Different hyperparameters, including activation functions, the learning rate, the number of hidden units, and initializers, are adjusted in a systematic way to achieve better model performance, as shown in Appendices A, B, C, and D. Finally, seven sets of hyperparameters with different regularizers and seed numbers have been selected. It is important that averaging the predictions given by several different sets of hyperparameters can provide not only the average values of mass predictions but also reliable estimations in the mass prediction uncertainties.

With the neural network, the RMS deviations between the experimental and theoretical masses are reduced from $ 0.603 $ to $ 0.200 $ MeV and $ 0.232 $ MeV for the training and validation sets, respectively. For most of the nuclei, this DNN model can give a better mass prediction compared with the FRDM12 model. Even for nuclei that have a poor mass prediction in the FRDM12 model, such as $ ^{41} $Cr and $ ^{42} $Cr, the DNN model can still reduce the RMS deviation and achieve a better mass prediction. However, there are still several nuclei with a worse mass prediction compared with the FRDM12 model, which may occur in the validation set with unsolved odd-even staggering problems. Further studies on these nuclei are required to improve the model, such as to provide a better description of the odd-even staggering in nuclei.

In the future, the same technique can be applied to other physics quantities related to nuclei. With the nuclear mass predicted in this study and by adjusting different hyperparameters, DNN models for different physics quantities, such as β-decay half-life and β-delayed neutron emission probability, can be generated and their performance can be compared with other current theoretical models to investigate whether this algorithm can improve the current predictions.

APPENDIX A: PERFORMANCE WITH DIFFERENT ACTIVATION FUNCTIONS

APPENDIX B: PERFORMANCE WITH DIFFERENT LEARNING RATES

APPENDIX C: PERFORMANCE WITH DIFFERENT NUMBERS OF HIDDEN UNITS

APPENDIX D: PERFORMANCE WITH DIFFERENT INITIALIZERS

Reference (34)

Hyperparameters	$ \sigma_{\rm training} $/MeV	$ \sigma_{\rm validation} $/MeV	$ \Delta\sigma_{\rm training} $ (%)	$ \Delta\sigma_{\rm validation} $ (%)
No regularizer, seed $ =3 $	$ 0.175 $	$ 0.228 $	$ 71.0 $	$ 62.2 $
L2 regularizer, $ \lambda=0.00001 $, seed $ =1 $	$ 0.204 $	$ 0.226 $	$ 66.2 $	$ 62.5 $
L2 regularizer, $ \lambda=0.00001 $, seed $ =2 $	$ 0.216 $	$ 0.235 $	$ 64.2 $	$ 61.0 $
Orthogonal regularizer, $ \lambda=0.1 $, seed $ =2 $	$ 0.212 $	$ 0.218 $	$ 64.8 $	$ 63.8 $
Orthogonal regularizer, $ \lambda=0.01 $, seed $ =1 $	$ 0.176 $	$ 0.238 $	$ 70.8 $	$ 60.5 $
Orthogonal regularizer, $ \lambda=0.001 $, seed $ =1 $	$ 0.219 $	$ 0.238 $	$ 63.7 $	$ 60.5 $
Orthogonal regularizer, $ \lambda=0.001 $, seed $ =2 $	$ 0.201 $	$ 0.238 $	$ 66.7 $	$ 60.5 $
Average of the above sets	$ 0.200 $	$ 0.232 $	$ 66.8 $	$ 61.5 $

Nuclear mass predictions based on a deep neural network and finite-range droplet model (2012)

Abstract：

References

Access

Article Metrics

Metrics

通讯作者: 陈斌, bchen63@163.com

Email This Article

Nuclear mass predictions based on a deep neural network and finite-range droplet model (2012)

HTML

目录