SciPost logo

SciPost Submission Page

High-dimensional and Permutation Invariant Anomaly Detection

by Vinicius Mikuni, Benjamin Nachman

This is not the latest submitted version.

This Submission thread is now published as

Submission summary

Authors (as registered SciPost users): Vinicius Mikuni
Submission information
Preprint Link: https://arxiv.org/abs/2306.03933v3  (pdf)
Code repository: https://github.com/ViniciusMikuni/PermutationInvariantAD
Data repository: https://zenodo.org/record/2603256
Date submitted: 2023-09-07 02:15
Submitted by: Mikuni, Vinicius
Submitted to: SciPost Physics
Ontological classification
Academic field: Physics
Specialties:
  • High-Energy Physics - Experiment
  • High-Energy Physics - Phenomenology
Approaches: Experimental, Computational, Phenomenological

Abstract

Methods for anomaly detection of new physics processes are often limited to low-dimensional spaces due to the difficulty of learning high-dimensional probability densities. Particularly at the constituent level, incorporating desirable properties such as permutation invariance and variable-length inputs becomes difficult within popular density estimation methods. In this work, we introduce a permutation-invariant density estimator for particle physics data based on diffusion models, specifically designed to handle variable-length inputs. We demonstrate the efficacy of our methodology by utilizing the learned density as a permutation-invariant anomaly detection score, effectively identifying jets with low likelihood under the background-only hypothesis. To validate our density estimation method, we investigate the ratio of learned densities and compare to those obtained by a supervised classification algorithm.

Current status:
Has been resubmitted

Reports on this Submission

Report #2 by Tilman Plehn (Referee 1) on 2023-11-22 (Invited Report)

  • Cite as: Tilman Plehn, Report on arXiv:2306.03933v3, delivered 2023-11-22, doi: 10.21468/SciPost.Report.8173

Report

The paper presents a seriously interesting application of modern neural networks to a key task at the LHC. It is novel, relevant, and well written. Sorry for being late with my report. I only have a few questions/remarks for the authors to consider, largely voluntarily:

- At the top of p.2 you say that there are no applications of diffusion models for density estimation. I view generative diffusion models over phase space as density estimation, but I could be easily convinced to agree with a more specific statement...
- As I have said for other papers, any chance you could include some kind of graphic representation of your network setup? So people can use it during talks?
- Why do you modify our two dark-jet datasets? I know it is extra work, but I think it would be very useful to show results for our `Aachen' and `Heidelberg' datasets. Any chance you could add that, to see what your network does when challenged more seriously?
- I am sorry, but I do not understand the argument before Eq.(7). The number of constituents is also one of the most useful observables to find semi-visible jets. What is it exactly that you want to achieve with this anomaly score in relation to N? And N = N_part? Sorry for not getting your point.
- Could you maybe add some more information to Tab.1, for instance on the inverse, Top background vs QCD signal performance? As far as I can see this table is the only way to compare your results to, for instance, Fig.4 in `QCD or What' or Fig.4 in the NAE paper.
- Similarly, any change you could show the inverse QCD signal performance in Fig.5? Do the ROC curves look the same?
- I am a little lost in your conclusions. In the beginning, your paper seems to be about the unsupervised density estimation, but the conclusion then ends with the positive note on the density ratio?

  • validity: -
  • significance: -
  • originality: -
  • clarity: -
  • formatting: -
  • grammar: -

Report #1 by Anonymous (Referee 2) on 2023-11-6 (Invited Report)

  • Cite as: Anonymous, Report on arXiv:2306.03933v3, delivered 2023-11-06, doi: 10.21468/SciPost.Report.8056

Strengths

1 -The paper details a novel computational method for using the learned density as a permutation-invariant anomaly detection score to improve anomaly detection. The authors show that the method is effective at dealing with higher dimensional parameter spaces.

2-It is written in a clear and intelligible way, it contains a good description of the problem and objectively evaluates the method against other methods

3-Code is provided to enable reproducibility. It contains a clear conclusion summarizing the results perspectives for future work.

Weaknesses

None

Report

The paper is a well written comprehensive study and I find it to be acceptable in its present form.

Requested changes

None

  • validity: top
  • significance: good
  • originality: good
  • clarity: top
  • formatting: perfect
  • grammar: perfect

Login to report or comment