9–18 Oct 2023
DESY
Europe/Berlin timezone

Columnar analysis

9 Oct 2023, 13:30
2h
room 010 (bldg. 67)

room 010

bldg. 67

Speaker

Laurids Jeppe (CMS (CMS Fachgruppe TOP))

Description

This exercise introduces the basics of columnar data analysis in the Python Scientific Ecosystem using the nanoAOD data format used by CMS. Columnar analysis here means processing high energy physics data in a vectorized way, giving potentially large speedups compared to traditional loop-based computing. The students will use the Python packages awkward, uproot and coffea to read and explore a nanoAOD event file, understand its format as well as its documentation, compute high-level variables, select events with simple cuts, as well as fill and plot histograms of the events.

Presentation materials