26–30 Aug 2024
Europe/Berlin timezone

Data Reduction Strategy at the European XFEL

30 Aug 2024, 15:40
15m
Saal A

Saal A

Contributed talk 3. Data, Automation and the Use of AI Mikrosymposium 3/4: Data, Automation and the Use of AI

Speaker

Egor Sobolev (European XFEL)

Description

The European XFEL is a megahertz repetition-rate facility producing extremely bright and coherent pulses of duration of the order of few femtoseconds or less. Owing to its X-ray imagers, specifically built to operate at these repetition rates (AGIPD, DSSC and LPD), the amount of data generated in the context of user experiments can exceed hundreds of gigabits per second, resulting in tens of petabytes stored every year. These rates and volumes pose significant challenges both for the facility and its users. In fact, if unaddressed, extraction and interpretation of scientific content is hindered, and investments and operational costs quickly becomes unsustainable.

We are working, in close collaboration with our users, to address the above-mentioned topics at different levels. On the administrative level, we largely revisited our scientific data and retention policies so as to establish a framework for data reduction, and includes provision of open and FAIR data. Then, we are also introducing comprehensive data management plans, to streamline and enhance facility-users communication, including requirements and agreements on services and methods. On a technical and scientific level, we are upgrading our data systems to incorporate data reduction tools and methods [1,2], which are either developed inside or outside the facility. Finally, we are developing extensive metrics to assess reduction quality, and to corroborate automation of reduction processes.

In this talk I will highlight challenges and solutions implemented to date, and detail our vision for user-centric data reduction [2].

[1] Schmidt P, et.al. (2024) Turning European XFEL raw data into user data. Front. Phys. 11:1321524. doi: 10.3389/fphy.2023.1321524
[2] Sobolev E, et.al. (2024) Data reduction activities at European XFEL: early results. Front. Phys. 12:1331329. doi: 10.3389/fphy.2024.1331329

I plan to submit also conference proceedings No

Primary authors

Egor Sobolev (European XFEL) Philipp Schmidt (European XFEL) Janusz Malka (European XFEL) David Hammer (European XFEL) Djelloul Boukhelef (European XFEL) Johannes Moeller (European XFEL) Karim Ahmed (European XFEL) Richard Bean (European XFEL) Ivette Jazmin Bermudez Macias (European XFEL) Johan Bielecki (European XFEL) Ulrike Boesenberg (European XFEL) Cammille Carinan (European XFEL) Fabio Dall'Antonia (European XFEL) Sergey Esenov (European XFEL) Hans Fangohr (European XFEL) Danilo Enoque Ferreira de Lima (European XFEL) Luis Maia (European XFEL) Hadi Firoozi (European XFEL) Gero Flucke (European XFEL) Patrick Gessler (European XFEL) Gabriele Giovanetti (European XFEL) Jayanath Koliyadu (European XFEL) Anders Madsen (European XFEL) Thomas Michelat (European XFEL) Michael Schuh (European XFEL) Marcin Sikorski (European XFEL) Alessandro Silenzi (European XFEL) Jolanta Sztuk-Dambietz (European XFEL) Monica Turcato (European XFEL) Oleksii Turkot (European XFEL) James Wrigley (European XFEL) Steve Aplin (European XFEL) Steffen Hauf (European XFEL) Krzysztof Wrona (European XFEL) Luca Gelisio (European XFEL)

Presentation materials