A common problem in a high energy physics experiment is extracting a signal from a much larger background. Posed as a classification task, there is said to be an imbalance in the number of samples belonging to the signal class versus the number of samples from the background class. In this work we provide a brief overview of class imbalance techniques in a high energy physics setting. Two case studies are presented: (1) the measurement of the longitudinal polarization fraction in same-sign $WW$ scattering, and (2) the decay of the Higgs boson to charm-quark pairs.
Cited by 3
Aguilar-Saavedra et al., Mass Unspecific Supervised Tagging (MUST) for boosted jets
J. High Energ. Phys. 2021, 12 (2021) [Crossref]
Aguilar-Saavedra et al., More light on Higgs flavor at the LHC: Higgs boson couplings to light quarks through
Phys. Rev. D 103, 095023 (2021) [Crossref]
Aguilar-Saavedra, Pulling the Higgs and top needles from the jet stack with feature extended supervised tagging
Eur. Phys. J. C 81, 734 (2021) [Crossref]