SciPost Submission Page
Many-gluon tree amplitudes on modern GPUs: A case study for novel event generators
by Enrico Bothmann, Walter Giele, Stefan Hoeche, Joshua Isaacson, Max Knobbe
This Submission thread is now published as
Submission summary
Authors (as registered SciPost users): | Enrico Bothmann · Joshua Isaacson · Max Knobbe |
Submission information | |
---|---|
Preprint Link: | https://arxiv.org/abs/2106.06507v2 (pdf) |
Code repository: | https://www.gitlab.com/ebothmann/blockgen-archive |
Date accepted: | 2022-02-28 |
Date submitted: | 2022-01-11 14:45 |
Submitted by: | Bothmann, Enrico |
Submitted to: | SciPost Physics Codebases |
Ontological classification | |
---|---|
Academic field: | Physics |
Specialties: |
|
Approach: | Computational |
Abstract
The compute efficiency of Monte-Carlo event generators for the Large Hadron Collider is expected to become a major bottleneck for simulations in the high-luminosity phase. Aiming at the development of a full-fledged generator for modern GPUs, we study the performance of various recursive strategies to compute multi-gluon tree-level amplitudes. We investigate the scaling of the algorithms on both CPU and GPU hardware. Finally, we provide practical recommendations as well as baseline implementations for the development of future simulation programs. The GPU implementations can be found at: https://www.gitlab.com/ebothmann/blockgen-archive.
Author comments upon resubmission
List of changes
- Provide additional instructions for compiling and running the code in the linked code repository
- Add minor clarifications and improve some formulations
- Improve layout and figure placement, fix typographical/punctuation issues
- Add two missing references for multi-jet measurements at UA1 and UA2
- Clarify why it is advantageous to use a real-number only algorithm for gluon-only algorithms
- Add technical details on the GPU used for the study
- Improve results on the comparison between sampling and summing with respect to reaching a pre-defined precision goal; related to that, add a comment on the use of an improved helicity sampling algorithm
- Extend the appendix to make our point on a possible hybrid CPU/GPU ansatz clearer; in particular note that we can not compare to a GPU-only ansatz, as a full-fledged phase-space generator implementation is still missing
Published as SciPost Phys. Codebases 3-r1.0 (2022) , SciPost Phys. Codebases 3 (2022)
Reports on this Submission
Report #1 by Peter Skands (Referee 1) on 2022-1-28 (Invited Report)
Report
I thank the authors for the elaborations and clarifications they made in response to my report on v1 of this paper. In my opinion all concerns have been addressed, and I am happy to recommend this paper to proceed to publication.