21–23 Feb 2024
DESY
Europe/Berlin timezone

Extracting Measurements from (legacy) publications

Not scheduled
20m
Auditorium (DESY)

Auditorium

DESY

Notkestr. 85 22607 Hamburg
Prompt Engineering

Speaker

Peter Steinbach (HZDR)

Description

Scientific Publishing has built the basis of knowledge exchange since the inception of the modern scientific method. Papers of last centuries contain uncountable experimental and theoretical findings. When exploring new materials or their facets, it becomes instrumental to extract these information from a myriad of papers. In this work, we present first attempts to extract viable physics information from existing publications using large language models. While the extraction of clearly defined terminology is straight forward, we showcase results with more vague information. We will compare our findings using different input formats and language models such as Mixtral 8x7B. We hope to start a conversation in the community how to bridge the paper-to-data-table gap in our community.

Primary authors

Presentation materials