LIPS

Name: LIPS
Start: 2024-02-21T09:00:00+01:00
End: 2024-02-23T18:00:00+01:00
Location: DESY

21–23 Feb 2024

DESY

Europe/Berlin timezone

Contribution List

25. Arrival and registration

21/02/2024, 13:00

Knowledge extraction, LLMs & general physics

45. Welcome

Gregor Kasieczka (UNI/EXP (Uni Hamburg, Institut fur Experimentalphysik)), Marcus Marcus Brüggen (UHH), Sarah Heim (ATLAS (ATLAS Dark Matter with Higgs))

21/02/2024, 14:00

Knowledge extraction, LLMs & general physics

27. Extracting Measurements from (legacy) publications (20'+10')

Peter Steinbach (HZDR)

21/02/2024, 14:10

Knowledge extraction, LLMs & general physics

28. Exploring the Strong Coupling and Dark Matter through Natural Language Processing (30'+10')

Annika Eichler (MSK (Strahlkontrollen)), Antonin Sulc (MCS (Control System))

21/02/2024, 14:40

Knowledge extraction, LLMs & general physics

29. ChATLAS: An AI Assistant for the ATLAS Collaboration (20'+10')

Cary Randazzo (Louisiana Tech)

21/02/2024, 16:00

Knowledge extraction, LLMs & general physics

30. Exploring LLM performance on Physics 101 coursework in different languages (20'+10')

Marcel Völschow (Hamburg University of Applied Sciences)

21/02/2024, 16:30

Knowledge extraction, LLMs & general physics

31. Educational Outreach with AI-Assisted CERN Open Data Analysis (20'+10') [zoom]

Paul Philipp Gadow (None), Philipp Gadow (Max-Planck-Institut für Physik)

21/02/2024, 17:00

Knowledge extraction, LLMs & general physics

18. Language Models for Multimessenger Astronomy (20'+10')

Dmitriy Kostiunin (Z_DV (Datenverarbeitung))

22/02/2024, 09:00

Astronomy and assistants

19. Cosmosage: a 7B LLM Fine-Tuned on Cosmology Papers and Textbooks (25'+10') [zoom]

Tijmen de Haan (KEK)

22/02/2024, 09:30

Astronomy and assistants

21. LLMs in astronomy (25' + 10') [zoom]

Rui Pan (The Hong Kong University of Science and Technology)

22/02/2024, 10:05

Astronomy and assistants

20. Semantic taxonomies for radio astronomy (25' + 10')

Micah Bowles (The University of Manchester)

22/02/2024, 11:10

Astronomy and assistants

23. Semantic association of astronomical images with natural language (20'+10')

Siddharth Mishra-Sharma (MIT)

22/02/2024, 11:45

Astronomy and assistants

22. MetaInsight: An LLM-Powered Research Assistant (20'+10') [zoom]

Mohamed El Ghafiani (LPMR, Faculty of Science, Mohammed First University, Oujda, Morocco)

22/02/2024, 12:15

Astronomy and assistants

32. Language models for quantum simulation (25'+10') [zoom]

Juan Carrasquilla (ETH Zurich)

22/02/2024, 14:15

Combining Domain Knowledge and LLMs, efficient computation

33. AI Honesty and Self-Supervision (25' + 10') [zoom]

Marat Freytsis (Anthropic)

22/02/2024, 14:50

Combining Domain Knowledge and LLMs, efficient computation

34. AI and symbolic math (25'+10')

Francois Charton (Meta AI)

22/02/2024, 15:25

Combining Domain Knowledge and LLMs, efficient computation

35. Prospects of LLMs for Fundamental Physics (25' + 10') [zoom]

Mariel Pettee (Lawrence Berkeley National Laboratory), Siavash Golkar

22/02/2024, 16:30

Combining Domain Knowledge and LLMs, efficient computation

36. Efficient Matrix Multiplication Algorithms for Quantized Language Models (20'+10') [in person, but demo on zoom]

Johannes Gäßler (Karlsruhe Institute of Technology)

22/02/2024, 17:05

Combining Domain Knowledge and LLMs, efficient computation

37. Helmholtz Blablador: An Inference Server for Scientific Large Language Models (20'+10')

Alexandre Strube (Heymholtz AI - Juelich Supercomputing Centre)

22/02/2024, 17:35

Combining Domain Knowledge and LLMs, efficient computation

38. PACuna: Automated Fine-Tuning of Language Models for Particle Accelerators (20'+10')

Antonin Sulc (MCS (Control System))

23/02/2024, 09:00

Accelerators and wrap up discussion

39. Building an Intelligent Accelerator Operations Assistant using Advanced Prompt Engineering Techniques and a High Level Control System Toolkit (20'+10')

Dr Frank Mayet (MPY1 (MPY Fachgruppe 1))

23/02/2024, 09:30

Accelerators and wrap up discussion

40. Large Language Models for Particle Accelerator Tuning (20'+10')

Jan Kaiser (DESY)

23/02/2024, 10:00

Accelerators and wrap up discussion

41. AccGPT: A Vision for AI Assistance at CERN's Accelerator Control and Beyond (20'+10')

Florian Rehm (CMS (CMS Fachgruppe Searches)), Florian Rehm (CERN)

23/02/2024, 10:30

Accelerators and wrap up discussion

44. Large Language Models in Fundamental Physics: An Interdisciplinary Roadmap (25'+10')

Sascha Caron (Radboud University and Nikhef)

23/02/2024, 11:30

Accelerators and wrap up discussion

42. Discussion: Taking stock and the path ahead

23/02/2024, 12:05

Accelerators and wrap up discussion

43. Good-Bye

23/02/2024, 13:05

Accelerators and wrap up discussion

10. AccGPT: A Vision for AI Assistance at CERN's Accelerator Control and Beyond

Florian Rehm (CERN)

Automated text and code generation

AccGPT is an on-going project to integrate AI into various levels of operations CERN’s operations, particularly in the domain of particle accelerator control. The goal is to embed AI assistants in critical areas: aiding control room operations for managing accelerators, assisting in coding for development purposes, and enhancing the effectiveness of documentation and knowledge retrieval. These...

17. Building an Intelligent Accelerator Operations Assistant using Advanced Prompt Engineering Techniques and a High Level Control System Toolkit

Dr Frank Mayet (MPY1 (MPY Fachgruppe 1))

In this talk a work-in-progress implementation of an accelerator operations assistant is presented. The assistant is based on the open Mixtral:8x7b-instruct LLM and can tap into the high-level control system toolkit 'doocs_generic_experiment', written for and used at the dedicated R&D accelerator ARES at DESY. Furthermore, it has access to the electronic logbook, as well as machine-specific...

2. ChATLAS: An AI Assistant for the ATLAS Collaboration

Daniel Murnane (Berkeley National Lab)

Automated text and code generation

The ATLAS Collaboration is composed of around 6,000 scientists, engineers, developers, students and administrators, with decades of institutional documentation spread across wikis, code docs, meeting agendas, recommendations, publications, tutorials, and project management systems. With the advent of retrieval augmented generation (RAG) and sophisticated large language models (LLMs) such as...

24. ChATLAS: An AI Assistant for the ATLAS Collaboration (20'+10')

Cary Randazzo (Louisiana Tech)

Astronomy and assistants

8. Educational Outreach with AI-Assisted CERN Open Data Analysis

Philipp Gadow (CERN)

LLM integration in traditional workflows

We introduce a summer school workshop designed for a group of gifted students from different backgrounds.
In this workshop, AI language assistants will be employed to aid students in conducting analysis of Open Data from the ATLAS experiment at CERN, with a specific emphasis on the Higgs Boson discovery.
This initiative aims to demonstrate the practical application of AI tools like ChatGPT...

7. Efficient Matrix Multiplication Algorithms for Quantized Language Models

Johannes Gäßler (Karlsruhe Institute of Technology)

Other

Large language models have - as the name implies - large numbers of parameters. As such not only the training costs but also the inference costs of these models are quite substantial. One strategy for reducing inference costs is to quantize the model weights from 16 bit floating point values to a format with 2-8 bits per weight. However, these custom data formats in turn require custom...

13. Exploring LLM performance on Physics 101 coursework in different languages

Marcel Völschow (Hamburg University of Applied Sciences)

Alignment, Ethics, and Reliability

Large language models see rapid adoption in various domains, prompting us to rethink established teaching paradigms. We examine their utility in university-level physics education, focusing on two main aspects: Firstly, how reliable are publicly accessible models in answering exam-style multiple-choice questions? Secondly, how does the question's language affect the models' performance? We...

6. Exploring the Strong Coupling Through Natural Language Processing

Antonin Sulc (MCS (Control System))

Direct LLM research use in fundamental physics

This work utilizes natural language processing (NLP) techniques to uncover trends and emerging directions in the research about the strong coupling of quantum chromodynamics. We developed an NLP pipeline to extract key topics and trends from abstracts related to strong coupling from the InspireHEP corpus. We performed topic modeling over time which reveals clusters and trends of related ideas...

26. Exploring the Strong Coupling Through Natural Language Processing (20'+10')

Antonin Sulc (MCS (Control System))

Knowledge extraction, LLMs & general physics

9. Extracting Measurements from (legacy) publications

Peter Steinbach (HZDR)

Prompt Engineering

Scientific Publishing has built the basis of knowledge exchange since the inception of the modern scientific method. Papers of last centuries contain uncountable experimental and theoretical findings. When exploring new materials or their facets, it becomes instrumental to extract these information from a myriad of papers. In this work, we present first attempts to extract viable physics...

14. Generating Lagrangians for Particle Theories

Yong Sheng Koay (Uppsala University)

Direct LLM research use in fundamental physics

We report progress in using LLM to generate particle theory Lagrangians. By treating Lagrangians as complex, rule-based constructs similar to linguistic expressions, we employ transformer architectures —proven in language processing tasks— to model and predict Lagrangians. A dedicated dataset, which includes the Standard Model and a variety of its extensions featuring various scalar and...

3. Helmholtz Blablador: An Inference Server for Scientific Large Language Models

Alexandre Strube (Heymholtz AI - Juelich Supercomputing Centre)

Other

Recent advances in large language models (LLMs) like chatGPT have demonstrated their potential for generating human-like text and reasoning about topics with natural language. However, applying these advanced LLMs requires significant compute resources and expertise that are out of reach for most academic researchers. To make scientific LLMs more accessible, we have developed Helmholtz...

4. Illuminating the Dark: Discovering in Dark Matter Research through Natural Language Processing

Antonin Sulc (MCS (Control System))

Other

This study utilizes natural language processing (NLP) techniques to analyze trends and emerging topics in dark matter research using abstracts from papers indexed on InspireHEP. In this work, we developed NLP pipelines to extract key topics and terms from the abstracts, assessing frequency and relationships between terms over time. With topic modeling we reveal emerging directions like.
This...

15. Language Models for Multimessenger Astronomy

Dmitriy Kostiunin (Z_DV (Datenverarbeitung))

Implicit physics understanding

The surge in observational capabilities and the heightened focus on time-domain astronomy have led to a substantial increase in data volume, reshaping how astrophysicists interpret, process, and categorize information. Despite the utilization of machine-readable data formats in certain instances, a significant portion of information is conveyed through natural language reports. To address the...

12. Large Language Models for Particle Accelerator Tuning

Jan Kaiser (DESY)

Other

Autonomous tuning of particle accelerators is an active and challenging field of research with the goals of reducing tuning times and enabling novel accelerator technologies for novel applications. Large language models (LLMs) have recently made enormous strides towards the goal of general intelligence, demonstrating that they are capable of solving complex task based just a natural language...

11. MetaInsight: An LLM-Powered Research Assistant

Mohamed El Ghafiani (LPMR, Faculty of Science, Mohammed First University, Oujda, Morocco)

LLM integration in traditional workflows

In the complex realm of academic research, scholars often grapple with the daunting task of efficiently navigating extensive literature, discerning emerging trends, and evaluating the novelty and feasibility of proposed research ideas. This abstract introduces "MetaInsight," an innovative LLM (Large Language Model)-powered research assistant designed to mitigate these challenges and augment...

1. PACuna: Automated Fine-Tuning of Language Models for Particle Accelerators

Antonin Sulc (MCS (Control System))

Automated text and code generation

Navigating the landscape of particle accelerators has become increasingly challenging with recent surges in contributions. These intricate devices challenge comprehension, even within individual facilities.
To address this, we introduce PACuna, a fine-tuned language model refined through publicly available accelerator resources like conferences, pre-prints, and books.
We automated data...

5. Semantic association of astronomical images with natural language

Siddharth Mishra-Sharma (MIT)

Direct LLM research use in fundamental physics

I will present a multi-modal model that associates astronomical observations imaged by the Hubble Space Telescope with natural language. I will show that the model embodies a meaningful joint representation between the highly-domain-specific images and text using a variety of downstream tasks. The model demonstrates the potential of using generalist rather than task-specific models in parts of...

Choose timezone

LIPS