Classifying cosmic-ray proton and light groups in LHAASO-KM2A experiment with graph neural network

Chao Jin; Song-zhan Chen; Hui-hai He; (for the LHAASO Collaboration)

doi:10.1088/1674-1137/44/6/065002

Abstract：

The precise measurement of cosmic-ray (CR) knees of different primaries is essential to reveal CR acceleration and propagation mechanisms, as well as to explore new physics. However, the classification of CR components is a difficult task, especially for groups with similar atomic numbers. Given that deep learning achieved remarkable breakthroughs in numerous fields, we seek to leverage this technology to improve the classification performance of the CR Proton and Light groups in the LHAASO-KM2A experiment. In this study, we propose a fused graph neural network model for KM2A arrays, where the activated detectors are structured into graphs. We find that the signal and background are effectively discriminated in this model, and its performance outperforms both the traditional physics-based method and the convolutional neural network (CNN)-based model across the entire energy range.

HTML

4. Experiment

We employ the Monte Carlo simulation to generate event data for training and evaluating KM2A GNN performance. The primary EAS events are generated by the CORSIKA package with the hadronic model QGSJETII [42]. The KM2A detector simulation is performed based on the Geant4 framework [43, 44]. We generate major CR groups including the Proton (P), Helium (He), medium group (CNO), heavy group (MgAlSi), and Iron (Fe). Total events are generated into four energy fragments, including 10 ~ 100 TeV, 100 ~ 1 PeV, 1 ~ 10 PeV, 10 ~ 100 PeV, with the spectral index of –2.7. Reconstructed energies from 100 TeV to 10 PeV are considered, which cover most of the CR knee region. For each task, these groups are divided into independent signal and background groups, where only P belongs to the signal for the P task, and P&He forms the signal for the L task.

After reconstruction of the simulated events [26], we further select events according to their reconstructed locations and directions. The reconstructed shower core spread inside the KM2A array within the distance 200 ~ 500 m from the array center is selected. We ignore the inner circular area (within 200 m) to suppress the disturbance from the WCDA for the KM2A reconstruction. Further, the reconstructed zenith angle below $ 35^{\circ} $ is also required. Consequently, 105732 events remain for the following analysis. We split the selected events into train, test, and evaluation data sets. In consideration for the data balance, the group ratios for each data set are readjusted to maintain roughly $ 1:1 $ signal-to-noise ratio (SNR). The readjusted data sets for each task are listed in Table 1. The dataset ratio between the two major energy fragments, with 100 TeV ~ 1 PeV and 1 ~ 10 PeV, is around $ 2:1 $.

data set	P		L
data set	signal	background	signal	background
train	14635	14595	24358	23733
test	2875	2831	4754	4713
evaluation	24921	22994	24921	22994

Table 1. Number of signal and background events for each dataset.

To train the GNN models, we employ supervised learning techniques with the mean square error (MSE) as the loss function. For each training epoch, the loss is calculated on the test dataset to avoid overfitting. The Adam [45] optimizer is used to optimize the model parameters based on adaptive estimation of low-order moments. The training procedure includes two steps, (i) two independent trainings for the GNN ED and MD models with the learning rate 0.001, and (ii) a subsequent fine-tuning procedure fuses the ED and MD model together with the learning rate 0.0001. It runs over a total of 80 epochs with the model already converged. All code is written in Python using the open-source deep learning framework PyTorch with GPU acceleration. For each model training, four identical candidates with different randomized weights are trained, and the one with the best performance is selected for further processing, which helps suppress the local optimization.

6. Conclusion

Deep learning has contributed extensively to significant progress in numerous fields. Therefore, we leverage this technology to improve classification performance in the LHAASO -KM2A experiment. We propose a fused GNN model, which constructs independent networks for the KM2A ED and MD arrays, and fuse their outputs for classification. This model is demonstrated to be effective, and its performance outperforms the traditional physics-based method as well as the CNN-based method over the entire energy range. Furthermore, we compare the performance of the GNN framework for independent ED and MD arrays. The ED array is found to behave better than the MD array. We attribute this to the higher density configuration of the ED array. Moreover, in comparison with the LHAASO hybrid detection method, our KM2A GNN model exhibits competitive classification performance. Owing to the large area and full duty cycle of the KM2A array, it can acquire statistics on the order of ~ 870× higher than the hybrid detection.

We thank the LHAASO Collaboration for their support on this project.

Reference (49)

[1]	A. Krizhevsky, I. Sutskever, and G. E. Hinton, Advances in neural information processing systems,1097-1105 (2016)
[2]	J. Redmon, S. Divvala, R. Girshick et al., IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (Las Vegas, NV, 2016), 779-788
[3]	S. Ren, K. He, R. Girshick et al., IEEE Transactions on Pattern Analysis and Machine Intelligence, 39: 1137-1149 (2017)
[4]	M. T. Luong, H. Pham, and C. D. Manning, the Conference on Emprirical Methods in Natural Language Processing, 1412-1421 (2015)
[5]	Y. Wu, M. Schuster, Z. Chem et al., arXiv: 1609.08144
[6]	G. Hinton, L. Deng, D. Yu et al., IEEE Signal processing magazine, 29: 82-97 (2012)
[7]	A. Hannun, C. Case, J. Casper et al., arXiv: 1412.5567
[8]	D. Amodei, S. Ananthanarayanan, R. Anubhai et al., International conference on machine learning,173-182 (2016)
[9]	Y. LeCun and Y. Bengio, Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, 1995, 3361(10): 1995
[10]	K. He, X. Zhang, S. Ren et al., Proceedings of the IEEE conference on computer vision and pattern recognition, 770-778 (2016)
[11]	D. E. Rumerlhar, Nature, 323: 533-536 (1986) doi: 10.1038/323533a0
[12]	S. Hochreiter and J. Schmidhuber, Neural computation, 9(8): 1735-1780 (1997) doi: 10.1162/neco.1997.9.8.1735
[13]	K. Cho, B. Van Merrinboer, C. Gulcehre et al., arXiv: 1406.1078
[14]	R. Berg, T. N. Kipf, and M. Welling, arXiv: 1706.02263
[15]	F. Monti, M. Bronstein, X. Bresson, Advances in Neural Information Processing Systems,3697-3707 (2017)
[16]	J. Gilmer, S. S. Schoenholz, P. F. Riley et al., Proceedings of the 34th International Conference on Machine Learning, 70: 1263-1272 (2017)
[17]	N. Choma, F. Monti, L. Gerhardt, et al., 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), 386-391 (2018)
[18]	M. Abdughani, J. Ren, L. Wu et al., arXiv: 1807.09088
[19]	J. Arjona Martinez, J. R. Vlimant, M. Spiropulu, et al., arXiv: 1810.07988
[20]	G. V. Kulikov and G. B. Khristiansen, Sov. Phys. JETP, 35(8): 441-444 (1959)
[21]	C. Jin, L. Q. Yin, S. Chen et al., Radiation Detection Technology and Methods, 3(3): 19 (2019) doi: 10.1007/s41605-019-0097-z
[22]	B. Bartoli, P. Bernardini, X. J. Bi et al., Physical Review D, 92(9): 092005 (2015) doi: 10.1103/PhysRevD.92.092005
[23]	M. Amenomori, S. Ayabe, D. Chen et al., Physics Letters B, 632(1): 58-64 (2006) doi: 10.1016/j.physletb.2005.10.048
[24]	M. Bertaina, W. D. Apel, J. C. Arteaga-Velazquez et al., Nuclear Physics B-Proceedings Supplements, 256: 149-160 (2014)
[25]	W. D. Apel, J. C. Arteaga-Velazquez, K. Bekk et al., Physical Review Letters, 107(17): 171104 (2011) doi: 10.1103/PhysRevLett.107.171104
[26]	H. H. He, LHAASO collaboration, Radiation Detection Technology and Methods, 2(1): 7 (2018) doi: 10.1007/s41605-018-0037-3
[27]	L. Yin, Z. Cao, S. S. Zhang et al., Accurate Measurement of the Cosmic Ray Proton Spectrum from 100 TeV to 10 PeV with LHAASO, PoS,508 (2017)
[28]	L. Q. Yin, S. S. Zhang, Z. Cao et al., arXiv: 1904.09130
[29]	P. K. Grieder, Extensive air showers, Berlin: Springer, 2010
[30]	A. Haungs, Journal of Physics G: Nuclear and Particle Physics, 29(5): 809 (2003) doi: 10.1088/0954-3899/29/5/303
[31]	Z. Wu, S. Pan, F. Chen et al., arXiv: 1901.00596
[32]	D. I. Shuman, S. K. Narang, P. Frossard et al., arXiv: 1211.0053
[33]	F. R. K. Chung and F. C. Graham, Spectral graph theory, American Mathematical Soc., (1997)
[34]	J. Bruna, W. Zaremba, A. Szlam et al., arXiv: 1312.6203
[35]	M. Defferrard, X. Bresson, and P. Vandergheynst, dvances in neural information processing systems,3844-3852 (2016)
[36]	T. N. Kipf and M. Welling, arXiv: 1609.02907
[37]	J. Masci, D. Boscaini, M. Bronstein et al., Proceedings of the IEEE international conference on computer vision workshops. 37-45 (2015)
[38]	D. Boscaini, J. Masci, E. Rodola et al., Advances in Neural Information Processing Systems,3189-3197 (2016)
[39]	F. Monti, D. Boscaini, J. Masci et al., Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5115-5124 (2017)
[40]	K. Greisen, Annual Review of Nuclear Science, 10(1): 63-108 (1960) doi: 10.1146/annurev.ns.10.120160.000431
[41]	K. Kamata and J. Nishimura, Progress of Theoretical Physics Supplement, 6: 93-155 (1958) doi: 10.1143/PTPS.6.93
[42]	D. Heck, G. Schatz, J. Knapp et al., CORSIKA: a Monte Carlo code to simulate extensive air showers, (1998)
[43]	S. Z. Chen, J. Zhao, Y. Liu et al., Nuclear Electronics & Detection Technology, 37(11): 1101 (2017)
[44]	S. Agostinelli, J. Allison, K. Amako et al., GEANT4: a simulation toolkit, Nuclear instruments and methods in physics research section A: Accelerators, Spectrometers, Detectors and Associated Equipment, 506(3): 250-303 (2003)
[45]	D. P. Kingma and J. Ba, arXiv: 1412.6980
[46]	J. R. Hoerandel, Astroparticle Physics, 19(2): 193-220 (2003) doi: 10.1016/S0927-6505(02)00198-6
[47]	S. Ter-Antonyan, Physical Review D, 89(12): 123003 (2014) doi: 10.1103/PhysRevD.89.123003
[48]	C. Jin, W. Liu, H. B. Hu and Y. Q. Guo, Physical Review D, 97: 123005 (2018) doi: 10.1103/PhysRevD.97.123005
[49]	C. Li, H. H. He, G. Xiao et al., Physical Review D, 98(4): 042001 (2018) doi: 10.1103/PhysRevD.98.042001

[1]	Meng-Lin Zhao , Yue Shao , Sai Wang , Xin Zhang . Prospects for probing dark matter particles and primordial black holes with the Square Kilometre Array using the 21 cm power spectrum at cosmic dawn. Chinese Physics C, 2026, 50(2): 025101. doi: 10.1088/1674-1137/ae1375
[2]	P.S. Prusachenko , D.N. Grozdanov , N.A. Fedorov , Yu.N. Kopatch , G.V. Pampushik , P.I. Kharlamov , V.R. Skoy , I.N. Ruskov , T.Yu. Tretyakova , A.V. Andreev , C. Hramco , P.G. Filonchik , TANGRA collaboration . Measurement of the differential and total cross-sections of γ-ray emission induced by 14.1 MeV neutrons for C, Al, Si, Ca, Ti, Cr, and Fe using the tagged neutron method. Chinese Physics C, 2026, 50(3): 1-20.
[3]	Lu Feng , Tao Han , Jing-Fei Zhang , Xin Zhang . Prospects for searching for sterile neutrinos in dynamical dark energy cosmologies using joint observations of gravitational waves and γ-ray bursts. Chinese Physics C, 2026, 50(1): 015105. doi: 10.1088/1674-1137/ae0b43
[4]	Li Tang , Liang Liu , Ying Wu . Null test of cosmic curvature using deep learning method. Chinese Physics C, 2026, 50(1): 015107. doi: 10.1088/1674-1137/ae0b42
[5]	Shi-Qi Ling , Zhao-Huan Yu . Imprints of an early matter-dominated era arising from dark matter dilution mechanism on cosmic string dynamics and gravitational wave signatures. Chinese Physics C, 2025, 49(10): 105105. doi: 10.1088/1674-1137/addcd6
[6]	Fan Yang , Xiangyun Fu , Bing Xu , Kaituo Zhang , Yang Huang , Ying Yang . Testing the cosmic distance duality relation using Type Ia supernovae and radio quasars through model-independent methods. Chinese Physics C, 2025, 49(10): 105108. doi: 10.1088/1674-1137/ade4a3
[7]	Jialei Wei , Ao Liu , Dejiang Li , Cuihong Wen . Physical parameter regression from black hole images using a multiscale adaptive neural network. Chinese Physics C, 2025, 49(12): 125105. doi: 10.1088/1674-1137/adf542
[8]	Shuang Qu , Jin-Yan Zhang , Man Bao . Nuclear mass predictions with a Bayesian neural network. Chinese Physics C, 2025, 49(10): 104106. doi: 10.1088/1674-1137/ade958
[9]	CHANG KONG-LIANG . DIMENSIONAL METHOD TO SOLVE THE DIFFUSION CONVECTION EQUATION OF SOLAR COSMIC RAYS. Chinese Physics C, 1978, 2(3): 200-210.

	P	L
baseline	0.836	0.904
GNN MD	0.847	0.93
GNN ED	0.861	0.936
GNN ED+MD	0.878	0.959

	Purity (%) (+stat.+sys.)		Aperture (${\rm m^2 \cdot sr}$) (+stat.+sys.)
	P	L	P	L
handcraft (hybrid) [27]	~90	~95	~1.5e3	~4e3
GBDT (hybrid) [28]	~90	~97	~3.6e3	~7.2e3
baseline (KM2A)	73.4±2.5±2.4	93.20.9±1.1	3.2e5±1.3e3±1.0e4	6.3e5±2.7e3±7.6e3
CNN (KM2A)	75.4±2.5±2.4	93.3±0.9±1.1	3.2e5±1.3e3±1.0e4	6.3e5±2.7e3±7.6e3
GNN MD (KM2A)	77.1±2.3±2.5	95.9±0.6±1.2	3.2e5±1.3e3±1.0e4	6.3e5±2.7e3±7.6e3
GNN ED (KM2A)	82.8±1.9±2.6	96.6±0.6±1.2	3.2e5±1.3e3±1.0e4	6.3e5±2.7e3±7.6e3
GNN ED+MD (KM2A)	84 ±1.9±2.7	98.2±0.4±1.2	3.2e5±1.3e3±1.0e4	6.3e5 ±2.7e3±7.6e3

Classifying cosmic-ray proton and light groups in LHAASO-KM2A experiment with graph neural network

Abstract：

References

Access

Article Metrics

Metrics

通讯作者: 陈斌, bchen63@163.com

Email This Article

Classifying cosmic-ray proton and light groups in LHAASO-KM2A experiment with graph neural network

Corresponding author: Chao Jin, jinchao@mail.ihep.ac.cn

HTML

3.1. Graph neural network overview

3.2. Graph neural network on LHAASO-KM2A

目录