Maria Bånkestad: Structured models for scientific machine learning: From graphs to kernels

  • Datum: 25 februari 2025, kl. 9.15
  • Plats: Lecture hall Sonja Lyttkens, Ångströmlaboratoriet, hus 10, Lägerhyddsvägen 1, Uppsala
  • Typ: Disputation
  • Respondent: Maria Bånkestad
  • Opponent: Jakob Macke
  • Handledare: Thomas B. Schön
  • Forskningsämne: Datavetenskap
  • DiVA

Abstract

This thesis investigates the reciprocal relationship between science and machine learning, showing how embedding scientific principles within machine learning models enhances accuracy and interpretability in complex scientific domains. Through five contributions, this work addresses challenges spanning molecular modeling, fluid dynamics, and graph-based learning, illustrating how scientific insights can guide model development and improve performance across diverse applications.

A central focus is developing models that directly incorporate physical symmetries and laws into their structure, creating novel, scientifically grounded approaches. For instance, the GeqShift model leverages E(3)-equivariant graph neural networks to predict NMR spectra with a significant accuracy boost by capturing three-dimensional molecular structures. Similarly, our SE(2)-equivariant graph neural network models rotational and translational symmetries to enhance data efficiency and performance in fluid dynamics simulations, demonstrating the strength of symmetry-aware models in complex physical domains.

This thesis advances graph-based machine learning by developing physics-informed models. For example, we introduce a subsampling model that incorporates principles from the classic Ising model of magnetism, introducing a novel approach to graph subsampling that enhances tasks like graph explanation and mesh sparsification. Building on these graph-based techniques, we also present an approach to nonnegative matrix factorization (NMF), leveraging graph structures to accelerate low-rank factorization.

In addition to symmetry-aware frameworks, we introduce the elliptical process, a flexible extension of the Gaussian process that adapts to non-Gaussian noise. This innovation allows the model to learn noise characteristics directly from data, producing robust predictions that address a broad spectrum of real-world challenges. 

These contributions underscore the dynamic exchange between scientific principles and machine learning, illustrating how physical knowledge enhances model performance and inspires new solutions. This thesis establishes a foundational framework for advancing scientific machine learning, paving the way for future breakthroughs in the field.

FÖLJ UPPSALA UNIVERSITET PÅ

Uppsala universitet på facebook
Uppsala universitet på Instagram
Uppsala universitet på Youtube
Uppsala universitet på Linkedin