26–30 Aug 2024
Europe/Berlin timezone

Real-Time Data Processing for Serial X-ray Crystallography

30 Aug 2024, 14:35
20m
Saal A

Saal A

Invited talk 3. Data, Automation and the Use of AI Mikrosymposium 3/4: Data, Automation and the Use of AI

Speaker

Thomas White (FS-SC)

Description

We have implemented a system for fully real-time processing of data from serial X-ray protein crystallography experiments. The system handles the full data rate from a 16 megapixel Dectris EIGER2 X detector at 133 frames per second, performing the standard serial crystallography data processing pipeline consisting of peak search, diffraction pattern indexing, spot “prediction” and integration. The processing time for each frame, on a single CPU, is much less than one second. Therefore, a small number of CPU cores (around 40) are sufficient to keep up with the data. This was achieved while handling the entire resolution of the detector: even higher speed is possible by binning the pixel values to reduce the effective number of pixels.

The system has been deployed at the P11 beamline of PETRA III, where it has been reliably used in a series of user experiments. It is based on the CrystFEL software for serial crystallography [1], in combination with the ASAP::O high-performance data framework developed at DESY [2]. Similar systems based on the same building blocks are now being tested at other experimental stations.

Real-time data processing offers many advantages. First, there is the obvious improvement in “situational awareness” during the experiment: the ability to spot problems, make improvements and know when enough data has been collected. In addition, since there is no technical need to store the detector readout data on disk, there is potential for drastic reductions in the high data storage costs which are currently associated with serial crystallography experiments. But are we ready to make the required changes to our established workflows?

This contribution will describe our real-time pipeline in detail, and discuss the implications for the way we perform experiments at large-scale facilities.

[1] https://desy.de/~twhite/crystfel/

[2] https://asapo.pages.desy.de/asapo/

I plan to submit also conference proceedings No

Primary author

Thomas White (FS-SC)

Co-authors

Tim Schoof (DESY) Sergey Yakubov (IT (IT Scientific Computing)) Aleksandra Tolstikova (FS-SC Photon Science Scientific Computing) Philipp Middendorf (FS-CFEL-1 (Forschung mit Photonen Experimente 1)) Mikhail Karnevskiy (IT (IT Scientific Computing)) Dr Valerio Mariani (Linac Coherent Light Source, SLAC National Accelerator Laboratory, Menlo Park, USA) Alessandra Henkel (FS-CFEL-1 (Forschung mit Photonen Experimente 1)) Mr Bjarne Klopprogge (FS-CFEL-1 (CFEL-Coherent X-Ray Imaging)) Juergen Hannappel (IT (IT Scientific Computing)) Dominik Oberthuer (FS-CFEL-1 (Forschung mit Photonen Experimente 1)) Ivan De Gennaro Aquino (FS-CFEL-1 (Forschung mit Photonen Experimente 1)) Dmitry Egorov (FS-CFEL-1-SC (FS-CFEL-1 Fachgruppe SC)) Anna Carina Munke (DESY/Uppsala University) Dr Janina Sprenger (FS-CFEL-1 (Forschung mit Photonen Experimente 1)) Guillaume Pompidor (DESY) Helena Taberman (FS-PETRA-D (FS-PET-D Fachgruppe P11)) Dr Andrey Gruzinov (DESY) Jan Meyer (FS-CFEL-1-BMX (FS-CFEL-1 Fachgruppe BMX)) Johanna Hakanpaeae (FS-PETRA-D (FS-PET-D Fachgruppe P11)) Martin Gasthuber (IT (IT Scientific Computing))

Presentation materials

There are no materials yet.