SciPost logo

SciPost Submission Page

SPANet: Generalized Permutationless Set Assignment for Particle Physics using Symmetry Preserving Attention

by Alexander Shmakov, Michael James Fenton, Ta-Wei Ho, Shih-Chieh Hsu, Daniel Whiteson, Pierre Baldi

This Submission thread is now published as

Submission summary

Authors (as registered SciPost users): Michael James Fenton
Submission information
Preprint Link: https://arxiv.org/abs/2106.03898v3  (pdf)
Code repository: https://github.com/Alexanders101/SPANet
Data repository: http://mlphysics.ics.uci.edu/data/2021_ttbar/
Date accepted: 2022-02-22
Date submitted: 2021-12-24 21:44
Submitted by: Fenton, Michael James
Submitted to: SciPost Physics
Ontological classification
Academic field: Physics
Specialties:
  • High-Energy Physics - Experiment
  • High-Energy Physics - Phenomenology
Approaches: Experimental, Computational

Abstract

The creation of unstable heavy particles at the Large Hadron Collider is the most direct way to address some of the deepest open questions in physics. Collisions typically produce variable-size sets of observed particles which have inherent ambiguities complicating the assignment of observed particles to the decay products of the heavy particles. Current strategies for tackling these challenges in the physics community ignore the physical symmetries of the decay products and consider all possible assignment permutations and do not scale to complex configurations. Attention based deep learning methods for sequence modelling have achieved state-of-the-art performance in natural language processing, but they lack built-in mechanisms to deal with the unique symmetries found in physical set-assignment problems. We introduce a novel method for constructing symmetry-preserving attention networks which reflect the problem's natural invariances to efficiently find assignments without evaluating all permutations. This general approach is applicable to arbitrarily complex configurations and significantly outperforms current methods, improving reconstruction efficiency between 19\% - 35\% on typical benchmark problems while decreasing inference time by two to five orders of magnitude on the most complex events, making many important and previously intractable cases tractable. A full code repository containing a general library, the specific configuration used, and a complete dataset release, are avaiable at https://github.com/Alexanders101/SPANet

Author comments upon resubmission

Resubmission to address minor editorial comments from reviewer

List of changes

Added short discussion of MEM from CMS ttH analysis

Published as SciPost Phys. 12, 178 (2022)


Reports on this Submission

Anonymous Report 1 on 2022-1-4 (Invited Report)

Report

I thank the authors for the additional clarifications and changes to the manuscript and support publication in the current form.

  • validity: -
  • significance: -
  • originality: -
  • clarity: -
  • formatting: -
  • grammar: -

Login to report or comment