1st Pan-European Advanced School on Statistics in High Energy Physics

Europe/Berlin
SR 4a/b (DESY Hamburg)

SR 4a/b

DESY Hamburg

Description

github-com/diana-hep/madminer

We are looking forward to welcome you to the first Pan-European Advanced School on Statistics in High Energy Physics.  The program consists of lectures on advanced or not so frequently covered topics, supplemented by tutorials. The school is open to PhD students and Post-docs. Participants should already have a good knowledge of basic statistical methods in data analysis.

Covered topics:

  •  Bayesian inference
  •  Machine learning
  •  Unfolding
  •  Gaussian Processes
  •  Non-Parametric Inference

The registration fee is 50 Euro and has to be paid cash.

More information on the INSIGHTS Marie Sklodowska-Curie ITN  can be found here

Vidyo connection  Allen Caldwell whiteboard lecture monday and tuesday: showing camera image of whiteboard, no audio:

https://vidyoportal.cern.ch/join/KzCDUbNKYXFf9o5gj1GRmX37Id4

DESY map with venues
Poster
Participants
  • Adam Parker
  • Amartya Rej
  • Anastasia Karavdina
  • Andrea Cardini
  • Andrea Malara
  • Andreas Lindner
  • Anjali Krishnan
  • Arne Christoph Reimers
  • Artem Basalaev
  • Artem Golovatiuk
  • Bram Schermer
  • Christof Sauer
  • Cornelius Grunwald
  • Cristiano Alpigiani
  • Daria Morozova
  • Dennis Schwarz
  • Dirk Kruecker
  • Erik Buhmann
  • Fabian Becherer
  • Fabian Sohns
  • Fabio Priuli
  • Ferdinand Schenck
  • Grégory Baltus
  • Henriette Petersen
  • Hevjin Yarar
  • Inna Henning
  • Isabell Melzer-Pellmann
  • Jindrich Lidrych
  • Kacper Lasocha
  • Karla Pena Rodriguez
  • Kilian Lieret
  • Kristian Boroz
  • Krystsina Petukhova
  • Ksenia de Leo
  • Leonora Vesterbacka
  • Lolian Shtembari
  • Luis Ignacio Estevez Banos
  • Lukas Layer
  • Marek Niedziela
  • Mariel Pettee
  • Marta Czurylo
  • Matteo Defranchis
  • Michaela Schever
  • Michele Faucii Giannelli
  • Mykola Savitskyi
  • Nataliia Kovalchuk
  • Nathan Simpson
  • Olaf Behnke
  • Oleg Filatov
  • Olin Pinto
  • Oliver Schulz
  • Paolo Gunnellini
  • Peter Fratric
  • Pim Verschuuren
  • Rafael Eduardo Sosa Ricardo
  • Rahul Balasubramanian
  • Roel Smits
  • Rohit Maitri
  • Rose Koopman
  • Runxuan Liu
  • Salvatore La Cagnina
  • Santeri Laurila
  • Sara Borroni
  • Sara Cerioli
  • Sascha Diefenbacher
  • Serena Palazzo
  • Simon Schnake
  • Simone Amoroso
  • Sitong An
  • Stefan Richter
  • Stephan Stern
  • Vasyl Hafych
  • Viktor Ananiev
  • Vitaly Magerya
  • Wouter Verkerke
  • Yu Xu
  • Ömer Penek
    • 12:30 14:00
      Registration SR 4a/b

      SR 4a/b

      DESY Hamburg

    • 14:00 18:10
      Session 1 SR 4a/b

      SR 4a/b

      DESY Hamburg

      • 14:00
        Welcome to School 5m
        Speaker: Dr Olaf Behnke (DESY)
        Slides
      • 14:05
        Bayesian Inference Lecture I 1h 25m
        1) What is probability, leading to derivation of Bayes’ theorem 2) Comparison of Bayes & Frequentist for the Poisson problem with and w/o background, comparison of Bayes with Neyman, Feldman-Cousins 3) Detailed analysis of the on/off problem 4) The role of priors, ways to define them, deciding when they are important
        Speaker: Allen Caldwell (MPI Munich)
        Class Notes
        Handout
      • 15:30
        Coffee Break 30m
      • 16:00
        Bayesian Analysis Toolkit Tutorial I 2h
        BAT session 1 will be devoted to brief intro to the Julia programming language, installing the package, and solving a simple Poisson problem
        Speaker: Dr Oliver Schulz (MPI Munich)
        Slides
    • 18:30 20:00
      Reception Foyer of the main auditorium (building 5)

      Foyer of the main auditorium (building 5)

      DESY Hamburg

      Map
    • 09:00 12:30
      Session 2 SR 4a/b

      SR 4a/b

      DESY Hamburg

      • 09:00
        Bayesian Inference Lecture II 1h 30m
        5) Model testing: the different schools 6) Detailed comparison and discussion of the Jeffreys’Lindley Paradox 7) Approaches to mapping the posterior pdf and our scheme for parallelizing MCMC
        Speaker: Allen Caldwell (MPI Munich)
      • 10:30
        Coffee Break 30m
      • 11:00
        Bayesian Analysis Toolkit Tutorial II 1h 30m
        BAT session 2 will look at comparing fitting a spectrum and determining whether or not a signal is present.
        Speaker: Dr Oliver Schulz (MPI Munich)
    • 12:30 14:00
      Lunch Break 1h 30m SR 4a/b

      SR 4a/b

      DESY Hamburg

    • 14:00 18:00
      Session 3 SR 4a/b

      SR 4a/b

      DESY Hamburg

      • 14:00
        Likelihood Free Inference 1h 30m
        General introduction + focus on likelihood-ratio based approaches
        Speaker: Gilles Louppe (Liege University)
        Slides
      • 15:30
        Coffee Break 30m
      • 16:00
        Likelihood-free Inference (Passive) Tutorial 30m
        Speaker: Gilles Louppe (University of Liège)
      • 16:30
        Probabilistic Programming 1h 30m
        Lecture and (Active) Tutorial on a high energy physics problem
        Speaker: Lukas Heinrich (CERN)
        Slides
    • 09:00 12:30
      Session 4 SR 4a/b

      SR 4a/b

      DESY Hamburg

      • 09:00
        Generative Models 1h 30m
        Lecture: - Adversarial Training - Generative Models (GAN, WGAN, Autoencoder, Variational Autoencoder) - Bonus topics: Bayesian networks, Autoencoders for anomalies
        Speaker: Gregor Kasieczka (Institut fuer Experimentalphysik / UHH)
        Slides
      • 10:30
        Coffee Break 30m
      • 11:00
        Variational Autoencoder Tutorial 1h 30m
        Training generative models on image data
        Speaker: Gregor Kasieczka (Institut fuer Experimentalphysik / UHH)
        Slides
    • 12:30 14:00
      Lunch Break 1h 30m SR 4a/b

      SR 4a/b

      DESY Hamburg

    • 14:00 18:45
      Session 5 SR 4a/b

      SR 4a/b

      DESY Hamburg

      • 14:00
        Exemplary usages of GANs in the ATLAS experiment 1h
        Fast simulation will be a crucial tool for all LHC experiments from Run 3 and especially during HL-LHC. The increase in luminosity will not be matched by a similar increase in computing resources, therefore fast simulation will be the only way forward. Fast simulation is already used by the LHC experiments but it will need to be significantly improved to be used by all analyses. A possible way forward is to exploit the generative tool developed by the ML community, in particular GANs. In this lecture I will present the current state-of-the-art in ATLAS and use it to describe the challenges faced by this approach. Additional examples will be taken from another GAN developed to generate and simulate di-jet and top quark events. While the generation of event is not currently the bottle neck in the MC production chain, it will soon require significant resources as new, more precise but also more complex MC generators will be developed. Keywords: fast calorimeter simulation, fast event generation, GAN optimisation, voxalisation, conditioning
        Speaker: Michele Faucci Giannelli (Edinburgh)
        Slides
      • 15:00
        Traditional inference with ML tools I 30m
        Speaker: Lukas Heinrich (CERN)
        Slides
      • 15:30
        Coffee Break 30m
      • 16:00
        Traditional inference with ML tools II 30m
        Speaker: Lukas Heinrich (CERN)
      • 16:30
        Automated Differentiation 45m
        Speaker: Lukas Heinrich (CERN)
        Slides
      • 17:15
        Bayesian Inference III 1h
        Speaker: Allen Caldwell (MPI)
    • 19:30 22:00
      School Dinner Canteen Extension Bld. 9

      Canteen Extension Bld. 9

      DESY Hamburg

    • 09:00 12:30
      Session 6 SR 4a/b

      SR 4a/b

      DESY Hamburg

      • 09:00
        Decision making under uncertainties" 45m
        In this seminar we present some aspects of the so-called "decision making under uncertainties"  that are typical in the context of business and strategical management, where the data represent  an important source of information that is not always exploited to the full extent of its potential.  In particular, we will describe a few cases taken from the experiences of Pangea Formazione,  where the Bayesian approach and more in general Bayesian probabilistic models, when paired with suitable data science solutions, have allowed to deal properly with uncertainties and to  provide valuable insights about corporate processes.
        Speakers: Fabio Priuli (Pangea Formazione S.r.l.), Sara Borroni (Pangea Formazione S.r.l.)
        Slides
      • 09:45
        Gaussian Processes for Particle Physicists I 45m
        * Definition and basic properties * Mean functions, covariance functions and parameter estimation * Relation to other statistical learning techniques * Applications in physics
        Speaker: Prof. Mikael Kuusela (Carnegie Mellon University)
        Slides
      • 10:30
        Coffee Break 30m
      • 11:00
        Gaussian Processes for Particle Physicists II 45m
        Speaker: Prof. Mikael Kuusela (Carnegie Mellon University)
      • 11:45
        Introduction to Unfolding Methods in HEP I 45m
        * Why is unfolding needed and why is it difficult? * Overview of common unfolding methods * Choice of the regularization strength * Uncertainty quantification
        Speaker: Prof. Mikael Kuusela (Carnegie Mellon University)
        Slides
    • 12:30 13:30
      Lunch Break 1h SR 4a/b

      SR 4a/b

      DESY Hamburg

    • 13:15 19:00
      Session 7 SR 4a/b

      SR 4a/b

      DESY Hamburg

      • 13:30
        Introduction to Unfolding Methods in HEP II 45m
        * Why is unfolding needed and why is it difficult? * Overview of common unfolding methods * Choice of the regularization strength * Uncertainty quantification
        Speaker: Prof. Mikael Kuusela (Carnegie Mellon University)
      • 14:15
        RooFitUnfold Tutorial 1h 30m
        Introduction to Unfolding inside RooFit. Includes - preparing inputs - choosing unfolding method, - systematics handling and bias calculation. In this tutorial you learn to set up your analysis using unfolding from start to finish using realistic physics examples.
        Speakers: Carsten Burgard (Nikhef), Dr Lydia Brenner (DESY), Mr Pim Verschuuren (Royal Holloway, University of London)
        Slides
      • 15:45
        Coffee Break 30m
      • 16:15
        RooFitUnfold Tutorial II 1h 15m
        Speakers: Carsten Burgard (Nikhef), Dr Lydia Brenner (DESY), Pim Verschuren (Royal Holloway, University of London)
      • 17:30
        Approximate Bayesian Computing 1h 30m
        This lecture will provide an overview of approximate Bayesian computation (ABC), starting with a focus on the motivation for the procedure, including when it makes sense to utilize it. The resulting approximation can be thought of as calculating the posterior under a contaminated data set; this interpretation provides a useful context for the procedure. The main challenge to using the approach is the computational difficulties, so the lecture will cover algorithms that ease this burden, including consideration of sequential Monte Carlo approaches, and the importance of careful choice of summary statistics.
        Speaker: Chad Shafer (CMU)
        Slides
    • 09:00 12:30
      Session 8 SR 4a/b

      SR 4a/b

      DESY Hamburg

      • 09:00
        Non-Parametric Inference I 1h 30m
        These two lectures will provide an overview of the utility and challenges of nonparametric approaches to statistical inference. Nonparametric inference has great promise to "let data speak for themselves" and enable a researcher to avoid restrictive model assumptions. This can be especially important when massive sets of data need to be summarized in a way that preserved scientific information while not missing important features. Topics covered will include nonparametric density estimation and nonparametric regression. Focus will be placed on the curse of dimensionality, and the crucial role it plays in motivating particular nonparametric approaches. Underlying statistical theory will be presented.
        Speaker: Chad Shafer (CMU)
        Jupyternotebook
        Slides Material II
        Slides part I
      • 10:30
        Coffee Break 30m
      • 11:00
        Non-Parametric Inference II 1h 25m
        Speaker: Chad Shafer (CMU)
      • 12:25
        Closing of the School 5m
        Speaker: Dr Olaf Behnke (DESY)
        Slides
    • 12:30 14:00
      Lunch Break 1h 30m SR 4a/b

      SR 4a/b

      DESY Hamburg