CoSy-seminar with Claudio Altafino
- Date
- 12 May 2026, 12:15–13:00
- Location
- Ångström Laboratory, Å2002
- Type
- Seminar
- Lecturer
- Claudio Altafino
- Organiser
- Department of Mathematics
- Contact person
- Jörgen Östensson
Francesco Ferranti holds a seminar. Title: Modeling self-attention in transformers as a multiagent dynamical system: equilibria, stability, and asymptotic behavior. Welcome!
Everyone is welcome and the first 40 people to register will be treated to a free lunch sandwich. If you do not want lunch, you are still welcome to join. Registration formula link: https://doit.medfarm.uu.se/bin/kurt3/kurt/8905685
Abstract: Transformer architectures have rapidly become the standard algorithm for many problems in deep learning (e.g. large language models or image classification). At the core of a transformer lies a so-called self-attention mechanism. In this presentation I will show how to model such self-attention mechanism as a continuous-time multiagent-like dynamical system model living on a sphere. Such dynamics is related to a multiagent version of the Oja flow, a dynamical system that computes the principal eigenvector of a matrix, corresponding for transformers to the value matrix. In the ``single-head'' time-invariant case, the equilibria of a self-attention dynamics can be classified into four classes: consensus, bipartite consensus, clustering and polygonal equilibria. For this simplified dynamics, multiple asymptotically stable equilibria from the first three classes often coexist. Interestingly, equilibria from the first two classes are always aligned with the eigenvectors of the value matrix, often but not exclusively with the principal eigenvector.