SciPost logo

SciPost Submission Page

Optimal Mass Variables for Semivisible Jets

by Kevin Pedro, Prasanth Shyamsundar

This is not the latest submitted version.

This Submission thread is now published as

Submission summary

Authors (as registered SciPost users): Kevin Pedro · Prasanth Shyamsundar
Submission information
Preprint Link: https://arxiv.org/abs/2303.16253v2  (pdf)
Code repository: https://github.com/kpedro88/evn_svj_public
Date submitted: 2023-04-11 00:21
Submitted by: Pedro, Kevin
Submitted to: SciPost Physics
Ontological classification
Academic field: Physics
Specialties:
  • High-Energy Physics - Phenomenology
Approaches: Computational, Phenomenological

Abstract

Strongly coupled hidden sector theories predict collider production of invisible, composite dark matter candidates mixed with regular hadrons in the form of semivisible jets. Classical mass reconstruction techniques may not be optimal for these unusual topologies, in which the missing transverse momentum comes from massive particles and has a nontrivial relationship to the visible jet momentum. We apply the artificial event variable network, a semisupervised, interpretable machine learning technique that uses an information bottleneck, to derive superior mass reconstruction functions for several cases of resonant semivisible jet production. We demonstrate that the technique can extrapolate to unknown signal model parameter values. We further demonstrate the viability of conducting an actual search for new physics using this method, by applying the learned functions to standard model background events from quantum chromodynamics.

Current status:
Has been resubmitted

Reports on this Submission

Report #2 by Anonymous (Referee 2) on 2023-7-12 (Invited Report)

  • Cite as: Anonymous, Report on arXiv:2303.16253v2, delivered 2023-07-12, doi: 10.21468/SciPost.Report.7495

Strengths

1. Exploration of strongly interacting dark sector has received a lot of attention lately, however the experimental searches are often limited by similarity of the signal with dominant multijet background. So any step towards improving discrimination is a step in the right direction.
2. Rather than restricting themselves to the most commonly studied Z' mediator, authors also considered two other production modes.
3. The paper did show possible gain in sensitivity.
4. I would also commend the authors for putting the code on github for scrutiny and use by the community.

Weaknesses

1. This is not so much a weakness of this paper per se, but as it stands, this study is somewhat Pythia8 HV model, and specific parameter choice dependent.
2. It is not obvious a priori that what EVN is outputting is mass.
3. It is slightly concerning that only 6000 or 12000 events per signal point were generated/used.
4. The network/result depends possibly on rinv.
5. Its not clear, if there is any dependence on MET or jet pT.

Report

Firstly I would like to apologise for the delay. June/July is a busy month with meetings and holidays.

This is a useful study, and as such I think deserves to be published. Some details can be clarified a bit more to improve the quality of the paper. Some suggestions are below.

Requested changes

1. The paper seems to be missing citing some other recent SVJ work, especially the ATLAS result and other studies about characteristics of SVJ and use of ML methods.
2. It would be appreciated if the authors explain a bit more how EVN output is the mass, not just a mass-like observable.
3. Did the authors look at the performance as a function of MET or the jet pT? It will be good to add that.
4. In Sec 4.2, the significance is given as Q = S times sqrt(B), surely a division is missed?
5. Did the authors think of making the procedure less dependent on rinv just apart from training on a mixed sample?
6. Minor text changes: QCD background -> multijet background, regular hadrons (in abstract) reads weird, final state will include SVJ (3rd paragraph in Sec 1)-> may include.
7. Also it will be useful to include a statement on the dependence on Pythia HV and lack of other HV modules.

  • validity: good
  • significance: good
  • originality: high
  • clarity: good
  • formatting: excellent
  • grammar: excellent

Author:  Kevin Pedro  on 2023-07-18  [id 3820]

(in reply to Report 2 on 2023-07-12)

We thank the referee for their thoughtful review of our paper. Our responses to the revisions requested by the referee can be found below. The updated version of the manuscript is available at https://arxiv.org/abs/2303.16253v3 and has been resubmitted to SciPost Phys.

  1. The ATLAS search was not published when the first draft of this paper was written. We have now added a citation in Section 1, paragraph 5: “A recent search from the ATLAS experiment sets limits on certain models of non-resonant t-channel SVJ production via Φ~\cite{ATLAS:2023swa}, but does not target resonant production.”

Other uses of ML for SVJs and studies of SVJ characteristics pertain to jet tagging and substructure. Those topics are not relevant to this work, which focuses on event-level mass reconstruction. We did not cite either our own or others’ work on such topics.

  1. We elaborated on this point in Section 1, paragraph 6: “In particular, Ref.~\cite{Kim:2021pcz} shows that for a fully visible final state where the target parameter is the theoretical mediator mass, the learned function is equivalent to the invariant mass calculation, and therefore we expect the EVN to produce optimal estimators in semivisible final states, as well.”

  2. Resonant searches typically are not binned in MET or jet pT; indeed, as high-mass resonance signatures include multiple jets, the latter quantity may be ambiguous. A basic discussion of the performance of conventional mass variables such as MT can be found in the literature. The high correlation between the learned variable and the conventional variables (shown in e.g. Fig. 4 for Z′) entails that the different variables must have similar performance. Therefore, basic physical principles will be followed in all cases, such as better resolution when jets have higher pT.

  3. Fixed.

  4. Our results show that even training on a mixed sample is unnecessary; the semisupervised nature of the training procedure automatically eliminates dependence on rinv (to the extent possible). Therefore, further investigation of more complicated decorrelation techniques was also deemed to be unnecessary.

  5. Changed to “QCD multijet background”, “standard model hadrons”, and “may include”.

  6. While the referee’s point about model dependence is correct in general, this study is actually less sensitive than most to effects from dark sector parameter variations. Those variations largely influence the showering and hadronization in the dark sector, with potential impact on the flavors and kinematic behavior of SM hadrons in the final state. However, this study does not consider jet substructure or other such variables, only high-level jet four-vectors and the ptmiss two-vector. The mediator mass and rinv are the only parameters that significantly affect those quantities, and the variations of those parameters are considered in great detail. A remark was already made about this in Section 2, paragraph 1: “The dark hadron mass scale does not significantly impact the event-level kinematic quantities that are used in this paper, so we do not consider variations of it or the parameters with related values.”

We added some clarification about this point to Section 2, paragraph 2: “The HV module simulates the dark sector dynamics using the Lund string model~\cite{Andersson:1983ia,Sjostrand:1984ic}; we use the default settings for the empirical parameters in this model, which are the values tuned for SM QCD. Different parameter values, or even a different dynamical model or generator software, could change the dark sector dynamics. However, such variations generally impact the formation and substructure of jets, rather than their final four-momenta, and therefore are not expected to have a significant impact on the event-level mass reconstruction pursued here.”

Report #1 by Anonymous (Referee 1) on 2023-7-6 (Invited Report)

  • Cite as: Anonymous, Report on arXiv:2303.16253v2, delivered 2023-07-05, doi: 10.21468/SciPost.Report.7452

Strengths

1. This paper studies the interesting question of improving collider searches for semi-visible jets. This is a motivated model with distinct collider signatures for which dedicated and novel searches can significantly improve the potential for discovery.

2. The paper presents a novel approach using ML tools to searching for semi-visible jets and conclusively shows that this approach is superior to the current state of the art.

3. This paper explores the applicability of this method to several signal models and demonstrates that their method is applicable and effective for all of them.

4. This paper explores the possibility of searching for signals outside the parameter space of the training data, and shows that their method works very well in that case.

5. The paper is clearly and logically written.

Report

As noted above, the paper concerns an important and interesting subject, and quantitatively shows that their proposed method is an improvement over the state of the art. I believe, however, that some revisions are required before the paper can be published.

1. The main results plots for two of the production models, figs 6, 15, show the improvement relative to state of the art methods in the third panel. These distributions, particularly in fig 6, show some values below 1 and some values above 1. I request that the authors estimate how much improvement this method will give in reach and/or sensitivity of a search to a particular model.

2. In figure 4, why is the fit done to a line with no offset? Namely, why do they use y=mx instead y = mx+b. Also in figure 4, the middle column seems to have a positive correlation between the variables, so why is m < 0?

3. The authors say that the R_T cut is important for reducing background. As such, the authors should explain in more detail how their cut differs from the experimental papers they cite, and how exactly they expect their different cut can compensate for a lack of detector simulation.

4. The authors should explain in detail what chi is in the Feynman diagrams of fig 1. I believe it is meant to be a dark quark which then showers and hadronizes into a jet of dark hadrons, but that should be in the text.

5. There is a mistake in definition of sensitivity at the beginning of section 4.2, there is a slash missing.

6. I'm not sure I understand the point of figure 2. The authors are not looking at signals with different numbers of phi's in the same signal region. Furthermore, the ratio of cross section depends very sensitively on the Yukawa couplings, so I'm not sure this plot is very meaningful for one value.

  • validity: good
  • significance: good
  • originality: good
  • clarity: good
  • formatting: excellent
  • grammar: excellent

Author:  Kevin Pedro  on 2023-07-18  [id 3819]

(in reply to Report 1 on 2023-07-06)

We thank the referee for their thoughtful review of our paper. Our responses to the revisions requested by the referee can be found below. The updated version of the manuscript is available at https://arxiv.org/abs/2303.16253v3 and has been resubmitted to SciPost Phys.

  1. We computed an estimate of the overall significance for each case and added the following descriptions to the paper: At the end of Section 4.2: “The overall significance for each signal model and mass variable can be approximated by adding the significance of each bin in quadrature. This calculation shows that the artificial variable has a 3--5% higher significance than MT, while MAOS has only a ~1% higher significance.” At the end of Section 5.2: “Computing the approximate overall significance shows that the artificial variable improves on MT2 by 17% for mΦ = 1000 GeV and 81% for mΦ = 2000 GeV.”

  2. We find that the offset b in the y=mx+b fit is very small and can be neglected. The negative sign was a mistake, which has been fixed. (In some cases, the artificial variable can have a negative correlation with the theory variable. In these cases, we multiply the artificial variable by -1, which is why all the plots show a positive correlation. We had forgotten to remove the minus signs from the values of m displayed in the plots.)

  3. We added more detail to the explanation of the RT cut: “Therefore, we apply a selection RT > 0.1, which is looser than the experimental thresholds of 0.15--0.25. The lower ptmiss values in the generator-level background imply that fewer events have high RT values, so this looser selection provides a level of background rejection similar to the realistic analysis.”

  4. We added the definition of the symbol χ where dark quarks are first mentioned in the text (Section 1, paragraph 2) and elaborated in the caption of Fig. 1: “Representative Feynman diagrams for leading-order production of a Z′ boson decaying to dark quarks χ (left), a single Φ boson associated with a dark quark and decaying to an SM quark and a dark quark (center), and a pair of Φ bosons each decaying to a dark quark and an SM quark (right).”

  5. Fixed.

  6. We included Fig. 2 to put each Φ production mode in context. While the optimal signal regions for each production mode are different, their absolute and relative cross sections illustrate their expected abundance in collider production. A realistic search strategy would likely consider all possible production modes, so it is useful to understand how many events would be expected in each case. In addition, the collider phenomenology of these different production modes was not thoroughly studied until this paper, and so it is our view that it is useful to introduce some basic properties of these signal models. In particular, these plots complement and extend Fig. 10 (left) from 1707.05326. While it is true that the cross sections are dependent on the Yukawa couplings, our choice of coupling value is clearly stated in Section 2.

Login to report or comment