Classify the Higgs decays with the PFN and ParticleNet at electron–positron colliders

Gang Li; Libo Liao; Xinchou Lou; Peixun Shen; Weimin Song; Shudong Wang; Zhaoling Zhang

doi:10.1088/1674-1137/ac7f21

Chinese Physics C> 2022, Vol. 46> Issue(11) : 113001 DOI: 10.1088/1674-1137/ac7f21

Classify the Higgs decays with the PFN and ParticleNet at electron–positron colliders

Gang Li ^1, ,
Libo Liao ^2, ,
Xinchou Lou ¹ ,
Peixun Shen ¹ ,
Weimin Song ³ ,
Shudong Wang ^1,4, ,
Zhaoling Zhang ³

1.
Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
2.
Wuzhou University, 82 Fumin Third Road, Wanxiu District, Wuzhou 543002, China
3.
College of Physics, Jilin University, 2699 Qianjin Street, Changchun 130012, China
4.
University of Chinese Academy of Sciences, Beijing 100049, China

Abstract
HTML
Reference
Related

PDF

Abstract：
Various Higgs factories are proposed to study the Higgs boson precisely and systematically in a model- independent way. In this study, the Particle Flow Network and ParticleNet techniques are used to classify the Higgs decays into multicategories, and the ultimate goal is to realize an "end-to-end" analysis. A Monte Carlo simulation study is performed to demonstrate the feasibility, and the performance looks rather promising. This result could be the basis of a "one-stop" analysis to measure all the branching fractions of the Higgs decays simultaneously.
- the Higgs Boson ,
- event classification ,
- Particle Flow Network ,
- ParticleNet

References

[1]	G. Aad et al. (ATLAS Collaboration), Phys. Lett. B 716, 1-29 (2012) doi: 10.1016/j.physletb.2012.08.020
[2]	S. Chatrchyan et al. (CMS Collaboration), Phys. Lett. B 716, 30-61 (2012) doi: 10.1016/j.physletb.2012.08.021
[3]	R. Lafaye, T. Plehn, M. Rauch et al., JHEP 08, 009 (2009), arXiv:0904.3866 doi: 10.1088/1126-6708/2009/08/009
[4]	C. Englert, A. Freitas, M. M. Mühlleitner et al., J. Phys. G 41, 113001 (2014), arXiv:1403.7191 doi: 10.1088/0954-3899/41/11/113001
[5]	H. Baer, T. Barklow, K. Fujii et al., The International Linear Collider Technical Design Report - Volume 2: Physics, arXiv: 1306.6352 (2013)
[6]	M. Aicheler, P. Burrows, M. Draper et al., A Multi-TeV Linear Collider Based on CLIC Technology: CLIC Conceptual Design Report, doi: 10.5170/CERN-2012-007(2012)
[7]	A. Abada et al., Eur. Phys. J. ST 228(2), 261-623 (2019) doi: 10.1140/epjst/e2019-900045-4
[8]	CEPC Study Group, CEPC Conceptual Design Report: Volume 1 - Accelerator, arXiv: 1809.00285 (2018)
[9]	CEPC Study Group, CEPC Conceptual Design Report: Volume 2 - Physics & Detector, arXiv: 1811.10545 (2018)
[10]	Y. bai et al., Chin. Phys. C 44(1), 013001 (2020), arXiv:1905.12903 doi: 10.1088/1674-1137/44/1/013001
[11]	F. An et al., Chin. Phys. C 43(4), 043002 (2019), arXiv:1810.09037 doi: 10.1088/1674-1137/43/4/043002
[12]	M. D. Schwartz, Modern Machine Learning and Particle Physics, arXiv: 2103.12226 (2021)
[13]	A. J. Larkoski, I. Moult, and B. Nachman, Phys. Rept. 841, 1 (2020), arXiv:1709.04464 doi: 10.1016/j.physrep.2019.11.001
[14]	R. Kogler et al., Rev. Mod. Phys 91, 045003 (2019), arXiv:1803.06991 doi: 10.1103/RevModPhys.91.045003
[15]	J. Cogan, M. Kagan, E. Strauss et al., JHEP 02, 118 (2015), arXiv:1407.5675
[16]	L. G. Almeida, M. Backović, M. Cliche et al., JHEP 07, 086 (2015), arXiv:1501.05968
[17]	L. de Oliveira, M. Kagan, L. Mackey et al., JHEP 07, 069 (2016), arXiv:1511.05190
[18]	P. Baldi, K. Bauer, C. Eng et al., Phys. Rev. D 93, 094034 (2016), arXiv:1603.09349 doi: 10.1103/PhysRevD.93.094034
[19]	D. Guest, J. Collado, P. Baldi et al., Phys. Rev. D 94, 112002 (2016), arXiv:1607.08633 doi: 10.1103/PhysRevD.94.112002
[20]	J. Pearkes, W. Fedorko, A. Lister et al., Jet Constituents for Deep Neural Network Based Top Quark Tagging, arXiv: 1704.02124 (2017)
[21]	S. Egan, W. Fedorko, A. Lister et al., Long Short-Term Memory (LSTM) networks with jet constituents for boosted top tagging at the LHC, arXiv: 1711.09059 (2017)
[22]	K. Fraser and M. D. Schwartz, JHEP 10, 093 (2018), arXiv:1803.08066
[23]	G. Louppe, K. Cho, C. Becot et al., JHEP 01, 057 (2019), arXiv:1702.00748
[24]	T. Cheng, Comput. Softw. Big Sci. 2, 3 (2018), arXiv:1711.02633 doi: 10.1007/s41781-018-0007-y
[25]	I. Henrion, J. Brehmer, J. Bruna et al., Neural Message Passing for Jet Physics, Deep Learning for Physical Sciences Workshop at the 31st Conference on Neural Information Processing Systems (NIPS) (2017)
[26]	Patrick T. Komiske, Eric M. Metodiev, and Jesse Thaler, Journal of High Energy Physics 01, 121 (2019)
[27]	H. Qu and L. Gouskos, Phys. Rev. D 101(5), 056019 (2020), arXiv:1902.08570 doi: 10.1103/PhysRevD.101.056019
[28]	E. M. Metodiev, B. Nachman, and J. Thaler, JHEP 10, 174 (2017), arXiv:1708.02949
[29]	P. T. Komiske, E. M. Metodiev, B. Nachman et al., Phys. Rev. D 98, 011502(R) (2018), arXiv:1801.10158 doi: 10.1103/PhysRevD.98.011502
[30]	A. Andreassen, I. Feige, C. Frye et al., Eur. Phys. J. C 79, 102 (2019), arXiv:1804.09720 doi: 10.1140/epjc/s10052-019-6607-9
[31]	P. T. Komiske, E. M. Metodiev, and J. Thaler, JHEP 11, 059 (2018), arXiv:1809.01140
[32]	Manzil Zaheer, Satwik Kottur, Siamak Ravanbakhsh et al., Deep Sets, arXiv: 1703.06114 (2017)
[33]	K. He, X. Zhang, S. Ren et al., Delving deep into rectifiers: surpassing human-level performance on ImageNet classification, in 2015 IEEE International Conference on Computer Vision (ICCV), IEEE, Santiago, Chile, pg. 1026.c (2015).
[34]	Kilian, W., Ohl, T. & Reuter, J., Eur. Phys. J. C 71, 1742 (2011) doi: 10.1140/epjc/s10052-011-1742-y
[35]	T. Sjostrand, S. Mrenna, and P. Z. Skands, JHEP 05, 026 (2006), arXiv:0603175 doi: 10.1088/1126-6708/2006/05/026
[36]	Xin Mo, Gang Li, Man-Qi Ruan et al., Chin. Phys. C 40(3), 033001 (2016)
[37]	Diederik P. Kingma, and Jimmy Ba, Adam: A Method for Stochastic Optimization, ICLR ArXiv: 1412.6980 (2015)
[38]	Laurens van der Maaten and Geoffrey Hinton, Journal of Machine Learning Research 9, 2579-2605 (2008)
[39]	H. Qu, Weaver, a streamlined yet flexible machine learning R&D framework for HEP, https://github.com/hqucms/weaver

[1]	Shaowei Lan , Qiuhua Liu , Yong Li , Shusu Shi . Energy Dependence of Elliptic Flow Ratio v₂^PP/v₂^RP in Heavy-ion Collisions Using the AMPT Model. Chinese Physics C, 2026, 50(3): 1-6.
[2]	Jing Ye , Wei-Zhou Jiang . Correlation between Zero-Sound modes and nuclear equation of state stiffness detected by light vector boson. Chinese Physics C, 2026, 50(1): 014101. doi: 10.1088/1674-1137/adfa81
[3]	Yongcheng Wu , Liang Xiao , Yan Zhang . Deep Learning to Improve the Sensitivity of Higgs Pair Searches in the 4b Channel at the LHC. Chinese Physics C, 2026, 50(3): 1-15.
[4]	S.O. Kara . Discovery Prospects for a Leptophilic Gauge Boson Z_ℓ at CEPC and ILC. Chinese Physics C, 2026, 50(3): 1-10.
[5]	Jialin He , Xinye Peng , Zhongbao Yin , Liang Zheng . Extracting the kinetic freeze-out properties of high energy pp collisions at the LHC with event shape classifiers. Chinese Physics C, 2026, 50(1): 014108. doi: 10.1088/1674-1137/ae07ba
[6]	Liang Liu , Hai-Nan Lin , Li Tang . Revised classification of the CHIME fast radio bursts with machine learning. Chinese Physics C, 2026, 50(1): 015102. doi: 10.1088/1674-1137/ae0725
[7]	Haotian Xu , Yufei Wang , Xiao-Fang Han , Lei Wang . 95 GeV Higgs boson and nano-Hertz gravitational waves from domain walls in the N2HDM. Chinese Physics C, 2026, 50(1): 1-14. doi: 10.1088/1674-1137/ae2082
[8]	Ali Çiçi , Hüseyin Dağ . Resolving the W boson Mass in the Lepton Specific Two Higgs Doublet Model. Chinese Physics C, 2026, 50(3): 1-19.
[9]	J. M. Wang , X. G. Deng , W. J. Xie , B. A. Li , Y. G. Ma . Bayesian inference of nuclear incompressibility from collective flow in mid-central Au+Au collisions at 400–1500 MeV/nucleon. Chinese Physics C, 2025, 49(12): 124105. doi: 10.1088/1674-1137/adf4a1
[10]	Siyu Tang , Zuman Zhang , Chao Zhang , Liang Zheng , Renzhuo Wan . Investigating the transverse-momentum- and pseudorapidity-dependent flow vector decorrelation in p–Pb collisions with a multi-phase transport model. Chinese Physics C, 2025, 49(12): 124104. doi: 10.1088/1674-1137/adef1b
[11]	Jin Sun , Zhi-Peng Xing , Seokhoon Yun . Revisiting axion-like particle couplings to electroweak gauge bosons. Chinese Physics C, 2025, 49(11): 113109. doi: 10.1088/1674-1137/adf321
[12]	Yang Liu , Rong Wang , Zaiba Mushtaq , Ye Tian , Xionghong He , Hao Qiu , Xurong Chen . Simulation of dark scalar particle sensitivity in η rare decay channels at HIAF. Chinese Physics C, 2025, 49(3): 034103. doi: 10.1088/1674-1137/ad9d1b
[13]	Liuxin Zhao , Honglei Li , Zhi-Long Han , Fei Huang , Xinyi Yan . Extra charged gauge boson W′ in Alternative Left-Right Model at future muon collider. Chinese Physics C, 2025, 49(12): 123102. doi: 10.1088/1674-1137/adec51
[14]	Hao-Qiao Li , Hai-Ning Yan , Jiayin Gu , Xiao-Ze Tan . Probing Z/W pole physics at high-energy muon colliders via vector-boson-fusion processes. Chinese Physics C, 2025, 49(10): 103102. doi: 10.1088/1674-1137/addfcd
[15]	Jialei Wei , Ao Liu , Dejiang Li , Cuihong Wen . Physical parameter regression from black hole images using a multiscale adaptive neural network. Chinese Physics C, 2025, 49(12): 125105. doi: 10.1088/1674-1137/adf542
[16]	C. T. Tran , M. A. Ivanov , A. T. T. Nguyen . Probing the ATOMKI X17 vector boson using Dalitz decays V → Pe⁺e⁻. Chinese Physics C, 2025, 49(11): 113105. doi: 10.1088/1674-1137/adf1f7
[17]	Shuang Qu , Jin-Yan Zhang , Man Bao . Nuclear mass predictions with a Bayesian neural network. Chinese Physics C, 2025, 49(10): 104106. doi: 10.1088/1674-1137/ade958
[18]	Muhammad Farhan Taseer , Subhash Singha . Impact of particle production mechanisms on pseudorapidity distribution and directed flow in Au+Au and Cu+Cu collisions at ${ \sqrt{{\boldsymbol s}_{\boldsymbol{ NN}}}}$ = 19.6 GeV using AMPT model. Chinese Physics C, 2025, 49(10): 104101. doi: 10.1088/1674-1137/ade660
[19]	ZHANG Da-Lin , QIU Sui-Zheng , LIU Chang-Liang , SU Guang-Hui . Steady state investigation on neutronics of a molten salt reactor considering the flow effect of fuel salt. Chinese Physics C, 2008, 32(8): 624-628. doi: 10.1088/1674-1137/32/8/007
[20]	WU SHI-SHU . ON NUCLEAR SINGLE-PARTICLE POTENTIALS（Ⅲ）SINGLEPARTICLE ENERGIES DETERMINED BY THE NONHERMITIAN POTENTIAL U_αβ=M_αβ（ε_β）. Chinese Physics C, 1979, 3(4): 469-483.

Access

Figures(5) / Tables(2)

Get Citation

Gang Li, Libo Liao, Xinchou Lou, Peixun Shen, Weimin Song, Shudong Wang and Zhaoling Zhang. Classify the Higgs decays with the PFN and ParticleNet at electron-positron colliders[J]. Chinese Physics C. doi: 10.1088/1674-1137/ac7f21

RIS(for EndNote,Reference Manager,ProCite)

BibTex

Txt

Milestone

Received: 2022-07-07

Article Metric

Article Views(2802)
PDF Downloads(87)
Cited by(0)

Policy on re-use

To reuse of Open Access content published by CPC, for content published under the terms of the Creative Commons Attribution 3.0 license (“CC CY”), the users don’t need to request permission to copy, distribute and display the final published version of the article and to create derivative works, subject to appropriate attribution.

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

HTML

I. INTRODUCTION

The historic observation of the Higgs boson in 2012 at the Large Hadron Collider (LHC) [1, 2] declared the discovery of the last missing piece of the most fundamental building blocks in the Standard Model (SM). The SM has been remarkably successful in describing experimental phenomena. However, a precision Higgs physics program would be critically important given that the SM does not predict the parameters in the Higgs potential, nor does it involve particle candidates for dark matter. The precision determination of the Higgs couplings to the SM particles, gauge bosons and leptons/quarks, are the agents probing the Higgs mechanism for generating masses [3]. In particular, potential observable deviations of the Higgs couplings from the SM expectations would indicate new physics [4]. Therefore, the Higgs discovery marks the beginning of a new era of both theoretical and experimental exploration. Various $ e^+e^- $ colliders were proposed as Higgs factories by the high energy physics community, such as ILC [5], CLIC [6], FCC-ee [7], and CEPC [8, 9].

The most important advantages of a Higgs factory are that the center of mass (CM) energy is precisely defined and that they could perform absolute measurements of the Higgs boson. Neglecting Z fusion production, in an $ e^+e^-\to ZH $ event, where the Z decays into a pair of visible fermions or their stable decay final states ($Z\rightarrow e^+e^-,\; \mu^+\mu^-,\; \tau^+\tau^-,\; \rm{or}\; q\bar{q}$), the Higgs boson can be identified from the kinematics of those fermion pairs or their stable daughters independent of the Higgs decays. For example, the $ Z\to e^+e^- $ and $ \mu^+\mu^- $ modes are studied systematically in Refs. [10, 11]. The production cross-section and most of the decay branching fractions of the Higgs could be measured model-independently by the counting method. For example, CEPC could measure the cross-section of $ e^+e^-\rightarrow ZH $, $ \sigma(ZH) $, at 240 GeV, to a precision of 0.5% and the branching fractions of the Higgs boson to a few percent, respectively, by combining the four decay modes of the Z boson [11, 9].

The physics goal of a Higgs factory must be accomplished by optimizing the detector design and making use of the latest developments in data science. Recently, various Machine Learning (ML) techniques have already shown very promising performance in data analysis for high energy physics [12], in particular for jet studies. For instance, jets are treated as images [13–18], as sequences [19–22], as trees [23, 24], as graphs [25], or sets [26, 27] of particles, and ML techniques, most notably deep neural networks (DNNs), are used to build new jet tagging algorithms automatically from (labeled) simulated samples and even data [28–31]. While the above ML techniques are used at jet-level for case studies, they naturally can be applied for the event level in $ e^+e^- $ collisions, which have much simpler topologies and are pile-up free.

In this article, two ML approaches are used to study the classification problem of Higgs events. The classification results can serve as the basis of an "end-to-end" (E2E) analysis, which enables the simultaneous analysis of almost all the Higgs decays modes with the state-of-the-art ML techniques, starting with particle-level information and ending with physics observables. The approach also is a "one-stop" analysis to support extracting all Higgs couplings and taking into account the correlations and commonalities of the same detector for the experiment. Throughout this paper, the term "one-stop" analysis refers to an analysis method used to extract multiple observables of the same type at once. It differs from a conventional analysis in several ways. First, because many physics observables are measured using modern ML techniques at the same time, "one-stop" analysis is more efficient. Second, ML techniques usually deploy more information. Instead of only some limited number of selection criteria being used in conventional analysis, four-momenta and impact parameters (only charged tracks) of all particles in an event will be used by the ML techniques. Third, "one-stop" analysis could take into account the correlations and commonalities of the same detector for the experiment. Because all the measurements and their correlations are obtained in a consistent way, creating a combination based on these measurements will be easy.

The rest of this paper is organized as follows. The ML methods used in this study are introduced in Sec. II, followed by the implementation of the ML methods with a Monte Carlo (MC) simulation in Sec. III. Finally, a summary is presented.

II. MACHINE LEARNING METHODS

Recently, various ML techniques were proposed for jet tagging studies. Among them, PFN [26] and ParticleNet [27] achieved superior performance.

In the original publication of PFN [26], the authors applied the Deep Sets concept [32] to the jet-tagging problem. They proposed two elegant model architectures, named EnergyFlow Network (EFN) and ParticleFlow Network (PFN), with provable physics properties, such as infrared and colinear safety. In these two architectures, the features of each particle are encoded into a latent space of Φ [32] and the category, F, is extracted from the summed representation in that latent space. Both Φ and F are approximated by neural networks. The key mathematical fact is that a generic function of a set of particles can be decomposed into an arbitrarily good approximation according to the Deep Set Theorem [32]. The performance of these models in classification problems is comparable with other more complicated models. The authors also tried to interpret and visualize what the model has learned [26].

Motivated by the success of CNNs, the ParticleNet [27] approach based on the Dynamic Graph Convolutional Neural Network (DGCNN) is proposed for learning on particle cloud data. The edge convolution ("EdgeConv") operation, a convolution-like operation for point clouds, is used instead of the regular convolution operation. One important feature of the EdgeConv operation is that it can be easily stacked, just like regular convolutions. Therefore, another EdgeConv operation can be applied subsequently, which makes it possible to learn features of point clouds hierarchically. Another important feature is that the proximity of points can be dynamically learned with EdgeConv operations. The study shows that the graph describing the point clouds is dynamically updated to reflect the changes in the edges, i.e., the neighbors of each point. Reference [27] shows that this leads to better performance than keeping the graph static.

As suggested by the authors [26] and according to the performances of EFN and PFN, we choose PFN and ParticleNet to classify the Higgs decays. This ML attempt contains some distinct features in contrast to conventional data analysis. First, the ML approach is used to classify many physics processes at the same time. If some tiny decays are neglected, there are about 9 branching fractions of the Higgs decays to be measured. The number of classes is greater than 9 when the SM backgrounds are included. In addition, the classification results could be the basis of an E2E analysis, which means that all the particle-level information, such as four-momenta, PID, and impact parameters of charged particles, is used as input directly, and the network calculates the scores of each event. In this case, the analysis no longer needs some dedicated and complicated reconstruction tools, such as lepton/photon isolation, jet-clustering, τ finder, etc.

IV. SUMMARY AND DISCUSSION

In this paper, we presented a study of the classification of the Higgs decays with the state-of-the-art ML approaches at electron–positron colliders. We deploy the ML techniques and try to classify both the signal and background events with only particle-level information and to obtain the confusion matrices, which can be used in further data analysis. This approach is the basis of an efficient and balanced "one-stop" analysis, which makes it possible to measure all Higgs branching fractions using all detector information and taking all the commonalities and correlations into account. For the analyses of tens or hundreds of channels, they can be repeated using this technique in a few days if all data samples are ready. In contrast, the time could be considerably longer using conventional analysis methods.

This work is only a feasibility study. There are various possibilities to improve and further validate these methods. One is to enhance the performance by taking the sequential decays of W and Z bosons into account and add more categories in the classification, which can adopt more information for each category and enhance the classification performance. Another endeavor with more physical significance is incorporating some physics processes beyond the SM in the analysis, such as invisible and semi-invisible decays of the Higgs boson, which can enhance the sensitivity of an experiment to new physics. In addition, an important issue is to investigate the detailed performance of the classification method based on full simulation. It is also very constructive to take the full SM backgrounds and main systematic uncertainties into account.

ACKNOWLEDGE

The authors present special thanks to Yunxuan Song, Congqiao Li, Dr. Yu Bai, and Dr. Huilin Qu for useful discussion and advice. The authors thank the IHEP Computing Center for its firm support.

Reference (39)

Mode	Cross section or branching fraction
$ \sigma(e^+e^-\to e^+e^-H) $	7.04 fb
$ \sigma(e^+e^-\to \mu^+\mu^-H) $	6.77 fb
$ \sigma(e^+e^-\to \tau^+\tau^-H) $	6.75 fb
$ \sigma(e^+e^-\to q^+q^-H) $	136.81 fb
$ \sigma(e^+e^-\to ZZ_{l}) $	67.81 fb
$ \sigma(e^+e^-\to ZZ_{sl}) $	516.67 fb
$ \sigma(e^+e^-\to ZZ_{h}) $	556.49 fb
$ B(H\to c\bar{c}) $	2.91%
$ B(H\to b\bar{b}) $	57.7%
$ B(H\to \mu^+\mu^-) $	$ 2.19\times 10^{-4} $
$ B(H\to \tau^+\tau^-) $	6.32%
$ B(H\to gg) $	8.57%
$ B(H\to \gamma\gamma) $	$ 2.28\times 10^{-3} $
$ B(H\to WW^*) $	21.5%
$ B(H\to ZZ^*) $	2.64%
$ B(H\to Z\gamma) $	$ 1.53\times 10^{-3} $

Decay mode	$ e^+e^-H $		$ \mu^+\mu^- H $		$ \tau^+\tau^- H $		$ q\bar{q}H $
Decay mode	EFF	AUC	EFF	AUC	EFF	AUC	EFF	AUC
$ H\to c\bar{c} $	0.880	0.991	0.882	0.991	0.857	0.987	0.755	0.966
$ H\to b\bar{b} $	0.908	0.994	0.893	0.994	0.877	0.991	0.733	0.972
$ H\to \mu^+\mu^- $	0.997	1.000	0.986	1.000	0.981	1.000	0.983	1.000
$ H\to \tau^+\tau^- $	0.993	0.999	0.985	0.999	0.985	0.999	0.982	0.999
$ H\to gg $	0.810	0.985	0.830	0.986	0.816	0.982	0.736	0.954
$ H\to \gamma\gamma $	0.997	1.000	0.999	1.000	1.000	1.000	0.997	1.000
$ H\to ZZ^* $	0.650	0.958	0.667	0.960	0.585	0.947	0.535	0.926
$ H\to WW^* $	0.806	0.981	0.801	0.981	0.771	0.974	0.632	0.952
$ H\to \gamma Z $	0.921	0.996	0.936	0.996	0.910	0.993	0.896	0.993

Classify the Higgs decays with the PFN and ParticleNet at electron–positron colliders

Abstract：

References

Access

Article Metrics

Metrics

通讯作者: 陈斌, bchen63@163.com

Email This Article

Classify the Higgs decays with the PFN and ParticleNet at electron–positron colliders

HTML

A. ML model setup

B. Simulation samples

C. 9-category classification: training and evaluation

D. Attempted 39-category classification

目录