Preserve-to-reuse: Building REANA reproducible data analysis platform

17 Sept 2024, 09:30
45m
TU Dortmund

TU Dortmund

Restaurant Calla Vogelpothsweg 85 44227 Dortmund

Speaker

Dr Tibor Simko (CERN)

Description

In this talk we present the story of building REANA, the free and open
source platform for reusable and reproducible computational data
analyses. REANA allows researchers to structure information about
input data, analysis code, containerised environments and
computational workflows so that the analyses can be instantiated and
executed on remote compute clouds. REANA supports several declarative
workflow systems (CWL, Snakemake, Yadage), container technologies
(Apptainer, Docker) and compute backends (HTCondor, Kubernetes,
Slurm). We describe the REANA platform, the typical use cases from
experimental particle physics, exchange observations on sociological
and technological challenges inherent in facilitating reusable
computational research, as well as discuss the future plans for the
evolution of the platform.

Presentation materials