Mihai Croicu: Forecasting battles: New machine learning methods for predicting armed conflict

  • Date: 21 March 2025, 13:15
  • Location: Brusewitzsalen, Gamla Torget 6, Uppsala
  • Type: Thesis defence
  • Thesis author: Mihai Croicu
  • External reviewer: Thomas Chadefaux
  • Supervisors: Håvard Hegre, Joakim Kreutz, Nils B. Weidmann
  • Research subject: Peace and Conflict Research
  • DiVA

Abstract

Over the past decade, the field of conflict forecasting has undergone a remarkable metamorphosis, transforming from a series of isolated efforts with low predictive power into large, globe-spanning projects with impressive performance. However, despite this evolution, many challenges still remain. First, while we are good at predicting absolute risks, we are poor at predicting conflict dynamics (onsets, escalations, de-escalations and terminations). Second, we are over-reliant on spatio-temporal features and mechanistic models due to the nature of the event-data we use, thus excluding actor agency. Third, we do not handle either data or model uncertainty. Fourth, we are lagging behind the state-of-the-art in machine-learning. This dissertation attempts to resolve some of these salient difficulties, by contributing to six core elements of current-generation forecasting systems. First, time, by looking at the substantive effects and uncertainties of the temporal distance between data and forecast horizons. Second, space, by looking at the inherent uncertainties of high-resolution geospatial data and proposing a statistical method to address this. Third, feature space, by tackling the extreme feature sparsity in event-data and proposing a novel, deep active learning approach to mine features from existing large conflict-related text corpora. Fourth, substantive knowledge, by combining findings from the previous papers to take a fresh look at the microdynamics of conflict escalation. Fifth, the forecasting process itself, by building models that directly forecast from text, eliminating the intermediate step of manual data curation. Finally, the frontier of event-data, by looking at whether the news-media heavy way we collect violent fatal events can be extended to the collection of non-violent events. Methodologically, the dissertation introduces state-of-the art methods to the field, including the use of large language models, Gaussian processes, active learning and deep time series modelling. The six papers in the dissertation exhibit significant performance improvement, especially in forecasting dynamics.

FOLLOW UPPSALA UNIVERSITY ON

Uppsala University on Facebook
Uppsala University on Instagram
Uppsala University on Twitter
Uppsala University on Youtube
Uppsala University on Linkedin