SciPost Submission Page
Machine Learning-Based Energy Reconstruction for the ATLAS Tile Calorimeter at HL-LHC
by Francesco Curcio
Submission summary
| Authors (as registered SciPost users): | Francesco Curcio |
| Submission information | |
|---|---|
| Preprint Link: | scipost_202509_00065v2 (pdf) |
| Date submitted: | Dec. 9, 2025, 3:10 p.m. |
| Submitted by: | Francesco Curcio |
| Submitted to: | SciPost Physics Proceedings |
| Proceedings issue: | The 2nd European AI for Fundamental Physics Conference (EuCAIFCon2025) |
| Ontological classification | |
|---|---|
| Academic field: | Physics |
| Specialties: |
|
| Approach: | Computational |
Abstract
The High-Luminosity Large Hadron Collider (HL-LHC) will require the ATLAS Tile Calorimeter (TileCal) to achieve precise, low-latency energy reconstruction. The legacy Optimal Filtering algorithm degrades under highi pileup due to non-Gaussian noise from out-of-time-signals, motivating machine learning (ML) approaches. We study compact neural networks for Field-Programmable Gate Array (FPGA) deployment and find that one-dimensional Convolutional Neural Networks (1D-CNNs) outperform Multi-Layer Perceptrons (MLPs). These results highlight the promise of ML for real-time TileCal reconstruction at the HL-LHC.
List of changes
Abstract: "highi pileup" -> "high pileup" (typo).
done
footnote 1: remove "of the particle" since we are talking about cells in the calorimeter here.
done
section 2: "Each channel is digitised in two 12-bit gains" -> "Each channel is amplified with two gains and then digitised with an Analog to Digital Converter (ADC)" (like this it is more correct since the gains are amplification not digitisation and ADC is defined for later).
done
results sections and figure 1 and 2: replace "error" with "uncertainty".
done
results section: "a diagonal structure can be observed in the left panel, reflected as a step corresponding to" -> "a diagonal structure can be observed in the left panel, corresponding to (etpred-ettrue/ettrue) =-1 ...
done
results section: "This indicates that CNNs better capture correlations across the window, reducing large underestimations." I am not sure I understand what this means. The diagonal structure correspond to when the NN is predicting 0. So basically the MLP is just predicting 0s for some reasons and the CNN not. This sentence needs to be more clear.
This was meant to give an explanation of why this might be, in particular that the CNN "sees" the whole window, taking correlations between BCs into account when predicting the energy. Removed for clarity.
results section: it would be nice to add the ADC uncertainty for the optimal filtering algorithm to know how much the NNs improve.
added sentence that links to [7] where plots are available for OF.
figures 1,2,3,4: " Figures taken from [7]" -> "The figure is taken from [7])" or you can just put "[7]" if you need to save space.
done
results section: " compared to around 11" -> " compared to around 11 for the MLPs".
done
Conclusion: "Beyond the improved performance, the small model size ensures scalability across the TileCal channels to be reconstructed every 25 ns." This sentence is not clear. Why the small size ensures scalability and why to be reconstructed at 35ns. Do you mean allows for multiple networks to be added in the FPGA to cover several cells and to reconstruct at 40 MHz? Please clarify.
rephrased this
General: you claim several time that the NNs that you develop are suitable for FPGAs. Was this ever demonstrated and do you have a reference. If yes please add it.
Added a reference for this at the end of the result section
