Introduction to the Scikit-HEP project

Europe/Berlin
Description

Data analysis in High Energy Physics (HEP) has evolved considerably in recent years.

In particular, the role of Python has been gaining much momentum, sharing at present the show with C++ as a language of choice. Particle Physics needs to speak Python to make use of data science projects beyond HEP, e.g. for machine learning or big data frameworks such as Apache Spark.

Eduardo Rodrigues started the Scikit-HEP project in late 2016 with a few colleagues from various backgrounds and domains of expertise.

Scikit-HEP is a community-driven and community-oriented project with the aim of providing Particle Physics at large with an ecosystem for data analysis in Python. It aims to be the link between HEP and the data science domain, enabling particle physicists to do data analysis in Python.

The project has developed considerably in the past year and is now part of the official software stack of experiments such as Belle II and KM3NeT.

Eduardo will present an introduction to the project and its packages, such as uproot, boost-histogram or Particle. A small hands-on session will provide a practical introduction.

 

  • Talk "Introduction to the Scikit-HEP project"
    • in the main auditorium
    • on Tuesday from 14:00 to 15:00 [March 03]
  • Hands-on session
    • in seminar room 2 in building 2A.
    • on Wednesday morning from 9:00 to 12:00 [March 04]

If you plan to join us for the hands-on session, you can help us by registering  (no obligations - but planing will be easier for us with a ballpark number of participants).

Remote Access

For Eduardo's talk, we are trying to set up a video link if you want to join from remote. From ~15m before the session, you should be able to join by following the link

https://meet.desy.de/invited.sf?secret=DVXCAthF0ogzcDYF5oJD3g&id=250301505

(see also this manual pdf)

Slides

 

  • Tuesday 3 March
    • Introduction to Scikit-HEP Hörsaal/Main Auditorium

      Hörsaal/Main Auditorium

      Scikit-HEP

      The Scikit-HEP project is a community-driven and community-oriented project with the aim of providing Particle Physics at large with an ecosystem for data analysis in Python. The project started in Autumn 2016 and is in full swing.

      It is not just about providing core and common tools for the community. It is also about improving the interoperability between HEP tools and the scientific ecosystem in Python, and about improving on discoverability of utility packages and projects.

      For what concerns the project grand structure, it should be seen as a toolset rather than a toolkit. The project defines a set of five pillars, which are seen to embrace all major topics involved in a physicist's work. These are:

      Datasets: data in various sources, such as ROOT, Numpy/Pandas, databases, wrapped in a common interface.
      Aggregations: e.g. histograms that summarize or project a dataset.
      Modeling: data models and fitting utilities.
      Simulation: wrappers for Monte Carlo engines and other generators of simulated data.
      Visualization: interface to graphics engines, from ROOT and Matplotlib to even beyond.
      
      Conveners: Christian Voss (DESY), Dr Eduardo Rodrigues (Univeristy of Liverpool), Dr Thomas Hartmann (DESY)
      • 1
        Introduction to Scikit-HEP
        Speaker: Dr Eduardo Rodrigues (University of Liverpool)
  • Wednesday 4 March
    • Hands-On Scikit-HEP: Pratical exercises with Scikit-HEP projects Seminar Room 02 / Building 2A

      Seminar Room 02 / Building 2A

      Conveners: Christian Voss (DESY), Dr Eduardo Rodrigues (University of Liverpool), Dr Thomas Hartmann (DESY)
      • 2
        Scikit-HEP Hands-On
        Speaker: Dr Eduardo Rodrigues (University of Liverpool)
      • 10:15
        Break