Precision calibration of calorimeter signals in the ATLAS experiment using an uncertainty-aware neural network

ATLAS Collaboration

SciPost Submission Page

Precision calibration of calorimeter signals in the ATLAS experiment using an uncertainty-aware neural network

by ATLAS Collaboration

Submission summary

Authors (as registered SciPost users):

ATLAS Collaboration

Submission information
Preprint Link:	https://arxiv.org/abs/2412.04370v1 (pdf)
Date submitted:	2024-12-16 15:26
Submitted by:	Collaboration, ATLAS
Submitted to:	SciPost Physics

Ontological classification
Academic field:	Physics
Specialties:	High-Energy Physics - Experiment High-Energy Physics - Phenomenology
Approaches:	Experimental, Computational

Abstract

The ATLAS experiment at the Large Hadron Collider explores the use of modern neural networks for a multi-dimensional calibration of its calorimeter signal defined by clusters of topologically connected cells (topo-clusters). The Bayesian neural network (BNN) approach not only yields a continuous and smooth calibration function that improves performance relative to the standard calibration but also provides uncertainties on the calibrated energies for each topo-cluster. The results obtained by using a trained BNN are compared to the standard local hadronic calibration and to a calibration provided by training a deep neural network. The uncertainties predicted by the BNN are interpreted in the context of a fractional contribution to the systematic uncertainties of the trained calibration. They are also compared to uncertainty predictions obtained from an alternative estimator employing repulsive ensembles.

Author indications on fulfilling journal expectations

Provide a novel and synergetic link between different research areas.
Open a new pathway in an existing or a new research direction, with clear potential for multi-pronged follow-up work
Detail a groundbreaking theoretical/experimental/computational discovery
Present a breakthrough on a previously-identified and long-standing research stumbling block

Current status:

In refereeing

Reports on this Submission

Report #2 by Anonymous (Referee 2) on 2025-3-13 (Invited Report)

Report

The ATLAS Collaboration presents a detailed study of Bayesian neural networks (BNN) for the energy calibration of clusters in the ATLAS calorimeters at the LHC. The study is novel in that it is one of the first applications of BNNs under realistic conditions at a collider experiment. It is also an application in a highly-relevant field of LHC physics, as energy clusters in calorimeters are used in almost every data analysis at the LHC. This work could hence very well open new pathways in the analysis of LHC data, as deep neural networks are frequently used for calibrations but the associated with network predictions have not gained much attention so far. It is hence plausible that the presented work contributes to improving the precision at the ATLAS experiment and possibly other high-energy physics experiments.

The presented work contains a detailed study of the achievable mean energy regression and the associated energy resolution, in comparison to uncalibrated clusters (EM scale), to an ATLAS standard calibration, and to a previously published regression with a deep neural network (DNN). In addition, the uncertainty predictions of the BNN are studied in detail and are compared to the predictions from repulsive neural networks (RE), which provide an independent uncertainty measure. The BNN is shown to provide similar (and sometimes even better) calibration improvements (over the standard technique ) as the DNN. The uncertainties of the BNN and the RE are found to be consistent and conservative, which gives confidence in their robustness.

The paper is in general well written and contains interesting results that are worth publishing in this journal. My main criticism is two-fold:
1) The discussion of uncertainties is not consistent throughout the paper. As the main purpose of this study are the uncertainties, this should be improved. The main difficulty arises from the different nomenclature of uncertainties in data science and high-energy physics, and hence the interpretation of the BNN and RE uncertainties. Please refer to my comments below.
2) The difference in performance between the BNN and the DNN is not discussed in detail. In principle, the BNN should have the same performance of the DNN, unless there are differences in these approaches beyond the BNN’s estimate of the weight variances. One difference is the use of the 3 Gaussians for the loss term in the BNN, which is also mentioned in the paper, but is not discussed in detail. Other differences could be due to the self-regularizing nature of BNNs or to differences in the networks’ input features, pre-processing, network widths, depths etc. Also here, please refer to comments below.

I recommend this paper to be published in SciPost Physics once my comments below are answered.

Uncertainties:

page 12, last paragraph of Section 3.2: “They provide measures of the ultimate accuracy achieved with the trained calibration network.” - This is a strong statement that would benefit from a more detailed discussion of what predictive uncertainties are, how they are defined and what they are intended to cover. I suggest to introduce the relevant concepts much earlier and with more clarity rather than in Section 5 (see also below).

page 18, last paragraph of Section 3.3.2: “They potentially represent important contributions to the nuisances characterising the overall local systematic uncertainties.” - Also these statement are potentially strong. It is important for the reader what is understood by “local systematic uncertainties” in this context and this would again benefit from more clarify in the discussion of what BNN uncertainties are intended to cover and what is only introduced very late in Section 5.

Figure 2: introduces statistical and systematic uncertainties without further context of whether these correspond to the HEP understanding of statistical and systematic uncertainties. Again, information from Section 5 is necessary to understand this.

Section 5: Only this section introduces the concepts of episematic and aleatoric uncertainties. As discussed in the three points above, it is necessary to discuss this earlier in the paper, so that no confusion arises for readers that were not already familiar with BNNs. It is necessary to be very clear in how epistemic and aleatoric uncertainties are linked to statistical and systematic uncertainties. I find the discussion in Section 5 not very clear. First it is indicated in Section 5.1 that epistemic uncertainties may be understood as statistical uncertainties and aleatoric uncertainties as systematic uncertainties. At the end of Section 5.1, it is mentioned that both (epistemic and aleatoric uncertainties) give rise to nuisance parameters (= systematic uncertainties in the HEP physicists’ understanding). And then, in Section 5.2, it is discussed that both, systematic and statistical uncertainties, have epistemic and aleatoric “components” without specifying further how large these components could be or whether these components are even well-defined. In addition to making the discussion clearer here, it is important to make the link to HEP physicists’ understanding of statistical and systematic uncertainties in calibrations or measurements.

Comparison of BNN to DNN:

Section 3.3.2: Do you have evidence that 3 Gaussians are appropriate for approximating the likelihood? How do you choose this number? How do the results differ if you change it?

Figures 6, 7, 9: What is the origin of the differences in performance between BNN and DNN? Wouldn’t one expect that the mean predictions of the BNN reproduce those of the DNN if they use the same inputs and are both expressive enough for this regression task? How does the depth and width of the BNN and DNN compare? How is the DNN regularized compared to the BNN (which typically does not require regularization)? Or is this coming from the Gaussian mixture model? If it is the Gaussian mixture model, you may be able to show this by reducing the likelihood to a single Gaussian in the BNN.

Recommendation

Ask for minor revision

validity: top
significance: high
originality: top
clarity: high
formatting: perfect
grammar: perfect

Report #1 by Anonymous (Referee 1) on 2025-3-7 (Invited Report)

Strengths

1- First application of BNNs to calorimeter calibration in ATLAS.
2- A key advantage over standard deep learning-based calibrations.
3- Performance comparisons with existing methods (LCW and DNN) are well-structured.
4- Logical structure and good readability.

Weaknesses

1- The work does not yet demonstrate performance on real experimental data, making systematic uncertainties related to simulation inaccuracies a concern.
2- The discussion on Bayesian inference and loss function derivation is somewhat lengthy and should be streamlined.
3- It is not clear if the trained models and inference scripts will be provided for independent verification.

Report

This paper presents an application of Bayesian Neural Networks (BNNs) for the multi-dimensional calibration of calorimeter signals in the ATLAS experiment. The proposed method provides a continuous and smooth calibration function and estimates uncertainties associated with the calibrated energies. The results are compared with standard local hadronic calibration (LCW) and a deep neural network (DNN)-based approach. The BNN-calibrated energy is shown to improve upon previous techniques, with an additional advantage of providing well-characterized uncertainties.
The study is well motivated and relevant for calorimeter-based measurements with the ATLAS experiment. The application of uncertainty-aware machine learning techniques in energy calibration represents an important forward. Its importance, however, has not been demonstrated.
The methodology provides a foundation for future applications of uncertainty-aware deep learning in detector calibrations beyond calorimetry, potentially extending to other domains such as jet reconstruction.
The paper is well structured, with minimal jargon and a clear motivation. The methodology is described in sufficient detail, although some explanations of Bayesian techniques could be made more concise. The methodology, including dataset composition, feature selection, training procedures, and evaluation metrics, is clearly documented. The paper also references an external repository for further details on implementation.
It meets the journal acceptance criteria.

Requested changes

1- The discussion on Bayesian inference and loss function derivation is somewhat lengthy and should be streamlined.
2- "They potentially represent important contributions to the nuisances characterising the overall local systematic uncertainties.". This statement deserves a justification, i.e. a test which shows that this is indeed the case.

Recommendation

Publish (meets expectations and criteria for this Journal)

validity: good
significance: good
originality: good
clarity: high
formatting: good
grammar: excellent

SciPost Submission Page

Precision calibration of calorimeter signals in the ATLAS experiment using an uncertainty-aware neural network

by ATLAS Collaboration

Submission summary

Abstract

Author indications on fulfilling journal expectations

Current status:

Reports on this Submission

Report #2 by Anonymous (Referee 2) on 2025-3-13 (Invited Report)

Report

Recommendation

Report #1 by Anonymous (Referee 1) on 2025-3-7 (Invited Report)

Strengths

Weaknesses

Report

Requested changes

Recommendation

Login to report or comment