26 September 2025
DESY
Europe/Berlin timezone

A multiscale analysis of mean-field transformers in the moderate interaction regime

26 Sept 2025, 10:00
45m
Building 1b, Seminar Room 4ab (DESY)

Building 1b, Seminar Room 4ab

DESY

Notkestraße 85 22607 Hamburg Germany

Speaker

Giuseppe Bruno

Description

In this talk, we study the evolution of tokens across the depth of encoder-only transformer models at inference time, modeling them as a system of interacting particles in the infinite-depth limit. Motivated by techniques for extending the context length of large language models, we focus on the moderate interaction regime, where the number of tokens is large and the inverse temperature parameter scales accordingly. In this setting, the dynamics exhibit a multiscale structure. Using PDE analysis, we identify different phases depending on the choice of parameters.

Presentation materials

There are no materials yet.