DPG Phi
Verhandlungen
Verhandlungen
DPG

SMuK 2023 – wissenschaftliches Programm

Bereiche | Tage | Auswahl | Suche | Aktualisierungen | Downloads | Hilfe

AKPIK: Arbeitskreis Physik, moderne Informationstechnologie und Künstliche Intelligenz

AKPIK 2: Applications in Particle and Astroparticle Physics

AKPIK 2.3: Vortrag

Dienstag, 21. März 2023, 17:30–17:45, ZEU/0118

Fast Columnar Physics Analyses of Terabyte-Scale LHC Data on a Cache-Aware Dask ClusterSvenja Diekmann, Niclas Eich, Martin Erdmann, Peter Fackeldey, •Benjamin Fischer, Dennis Noll, and Yannik Rath — III. Physikalisches Institut A, RWTH Aachen University

The development of an LHC physics analysis involves numerous investigations that require the repeated processing of terabytes of data. Thus, a rapid completion of each of these analysis cycles is central to mastering the science project.

We present a solution to efficiently handle and accelerate physics analyses on small-size institute clusters. Our solution uses three key concepts: Vectorized processing of collision events, the ``MapReduce'' paradigm for scaling out on computing clusters, and efficiently utilized SSD caching to reduce latencies in IO operations. This work focuses on the latter key concept, its underlying mechanism, and its implementation.

Using simulations from a Higgs pair production physics analysis as an example, we achieve an improvement factor of 6.3 in the runtime for reading all input data after one cycle and even an overall speedup of a factor of 14.9 after 10 cycles, reducing the runtime from hours to minutes.

100% | Mobil-Ansicht | English Version | Kontakt/Impressum/Datenschutz
DPG-Physik > DPG-Verhandlungen > 2023 > SMuK