Sustainability and Machine Learning Group

Research group

University College London

We are a research group at UCL’s Centre for Artificial Intelligence.

Our research expertise includes:

data-efficient machine learning and probabilistic modeling
autonomous decision making and recommender systems
responsible AI and AI safety

We also work on applications related to social/environmental sustainability, climate and nuclear fusion.

If you are interested in joining the team, please check out our openings.

Meet the Team

Principal Investigators

Marc Deisenroth

Google DeepMind Chair of Machine Learning and Artificial Intelligence

Machine learning, Gaussian processes, AI for sustainability, Environmental modeling

Maria Perez-Ortiz

Associate Professor

Responsible AI, AI for sustainability, Recommender systems, Hybrid intelligence, Simulation intelligence

Administrators

Research Fellows

Daniel Giles

Senior Research Fellow

Tsunami modeling, HPC, Gaussian processes

PhD Students

Vignesh Gopakumar

PhD Student

Machine learning, Nuclear fusion, Bayesian optimization, Neural operators

Jake Cunningham

PhD Student

Machine learning, Gaussian processes, Earth systems modelling

Sicelukwanda Zwane

PhD Student

Machine learning, Robotics, Transfer Learning, Reinforcement Learning

Mathieu Alain

PhD Student

Machine learning, Graph neural networks, Diffusion models, PAC-Bayes

James Rudd-Jones

PhD Student

Sustainable policies, socio-environmental AI

Affiliates

So Takao

Senior Research Fellow (11/2020 - 07/2023)

Machine learning, Climate science, Fluid mechanics, Geometric mechanics

Oscar Key

PhD Student

Probabilistic modeling, Approximate inference, Machine learning, Climate science

Alumni

Yicheng Luo

PhD (09/2020-12/2024)

Meta-learning, Probabilistic Programming, Reinforcement Learning, Deep Generative Models

So Takao

Senior Research Fellow (11/2020 - 07/2023)

Machine learning, Climate science, Fluid mechanics, Geometric mechanics

Alexander Terenin

PhD (10/2018-11/2021)

Machine learning, Bayesian theory, Geometric machine learning

Samuel Cohen

PhD (09/2019-09/2024)

Machine learning, Optimal transport, Gaussian processes

Sanket Kamthe

PhD (10/2016-03/2021)

Machine learning, Reinforcement learning, Optimal control, Copulas

Mihaela Rosca

PhD (03/2020-05/2023)

Generative models, Optimization in deep learning, Reinforcement learning

Jacob Menick

Researcher (01/2020-10/2024)

Machine learning, Generative models, Large-scale deep learning, Variational inference, Information theory, Sparsity

Hugh Salimbeni

PhD (10/2015-10/2019)

Machine learning, Deep probabilistic models, Approximate inference

Steindór Sæmundsson

PhD (11/2016-11/2021)

Machine learning, Gaussian processes, Meta learning, Structural priors, Variational inference

K. S. Sesh Kumar

Research Associate

Machine learning, Discrete optimization, Differential privacy, Submodularity

Riccardo Moriconi

PhD (10/2016-02/2021)

Machine learning, Gaussian processes, Bayesian optimization

Janith Petangoda

PhD (10/2017-07/2022)

Machine learning, Meta learning, Differential geometry, Reinforcement learning

James Wilson

PhD (10/2017-08/2022)

Machine learning, Gaussian processes, Bayesian optimization, Practical approximate inference

Simon Olofsson

PhD (06/2016-03/2020)

Machine learning, Bayesian optimization, Mechanistic models, Model discrimination

Benjamin Chamberlain

PhD (10/2014-08/2018)

Machine learning, Community detection, Representation of graphs, Hyperbolic embeddings

Recent Blog Posts

Thin and Deep Gaussian processes

Selecting an appropriate kernel for Gaussian processes (GPs) can be challenging. Deep GPs avoid manual kernel engineering by low-dimensional embeddings of the inputs that explain the output data, but lose all the interpretability of shallow GPs. Alternatively one successively parameterize the lengthscale of a kernel, improving the interpretability but ultimately giving away the notion of learning lower-dimensional embeddings. Both methods are susceptible to particular pathologies which may hinder fitting and limit their interpretability. We propose a novel synthesis of both previous approaches. Each TDGP layer is a local linear transformation generating latent embeddings while also being the lengthscales of a kernel. This model is, unlike previous models, tailored to specifically discover lower-dimensional manifolds in the input data and behaves well when increasing the number of layers.

Daniel Augusto de Souza

Feb 12, 2024

Safe Trajectory Sampling in Model-based Reinforcement Learning

Background Model-based reinforcement learning (MBRL) approaches learn a dynamics model from system interaction data and use it as a proxy of the physical system. Instead of executing actions directly on the target system, the agent queries the dynamics model, using it to generate forward trajectories of how the system will evolve given a sequence of actions.

Sicelukwanda Zwane

Sep 13, 2023

Actually Sparse Variational Gaussian Processes

Gaussian processes infamously suffer from an $\mathcal{O}(N^3)$ computational complexity and $\mathcal{O}(N^2)$ memory requirements, rendering them intractable for even medium sized datasets where $N\gtrsim 10,000$. Sparse variational Gaussian processes have been developed to alleviate some of the pains of scaling GPs to large datasets by approximating the exact GP posterior with a variational distirbution conditioned on a small set of inducing variables designed to summarise the dataset.

Jake Cunningham

May 26, 2023

Optimal Transport for Offline Imitation Learning

With the advent of large datasets, offline reinforcement learning (ORL) is a promising framework for learning good decision-making policies without interacting with the real environment. However, offline RL requires the dataset to be reward-annotated, which presents practical challenges when reward engineering is difficult or when obtaining reward annotations is labor-intensive.

Yicheng Luo

Last updated on Jun 10, 2023

Iterative State Estimation in Non-linear Dynamical Systems Using Approximate Expectation Propagation

State estimation in nonlinear systems is difficult due to the non-Gaussianity of posterior state distributions. For linear systems, an exact solution is attained by running the Kalman filter/smoother. However for nonlinear systems, one typically relies on either crude Gaussian approximations by linearising the system (e.

So Takao

Jun 27, 2022

See all

Recent News

Papers at NeurIPS Conference

Papers accepted at NeurIPS Conference

Oct 2, 2023

Best Paper Award at FAccT 2023

Marc Deisenroth

Jun 15, 2023

Paper on Safe Trajectory Sampling Accepted at CASE

May 25, 2023

Dr. Rosca

Dr. Mihaela Rosca passed her PhD viva

Marc Deisenroth

May 3, 2023

Senior Research Fellowship in Machine Learning for Weather and Climate Science

Feb 19, 2023

See all

Recent Publications

Guaranteed Prediction Sets for Functional Surrogate models

We propose a method for obtaining statistically guaranteed prediction sets for functional machine learning methods: surrogate models …

Ander Gray, Vignesh Gopakumar, Sylvain Rousseau, Sebastien Destercke

Calibrated Physics-Informed Uncertainty Quantification

Neural PDEs have emerged as inexpensive surrogate models for numerical PDE solvers. While they offer efficient approximations, they …

Vignesh Gopakumar, Ander Gray, Lorenzo Zanisi, Timothy Nunn, Daniel Giles, Matt Kusner, Stanislas Pamela, Marc P. Deisenroth

Semantic Cross-Pose Correspondence from a Single Example

This article focuses on predicting how an object can be transformed to a semantically meaningful pose relative to another object, given …

Denis Hadjivelichkov, Sicelukwanda N. T. Zwane, Marc P. Deisenroth, Lourdes Agapito, Dimitrios Kanoulas

How Can We Diagnose and Treat Bias in Large Language Models for Clinical Decision-Making?

Kenza Benkirane, Jackie Kay, Maria Perez--Ortiz

See all publications

Recent & Upcoming Talks

Maud Lemercier: Non-adversarial training of Neural SDEs with signature kernel scores

Neural SDEs are continuous-time generative models for sequential data. State-of-the-art performance for irregular time series …

Oct 19, 2023

Thomas Baldwin-McDonald: Bayesian Deep Learning with Physics-informed Gaussian Processes

Dynamical systems are ubiquitous across the natural sciences, with many physical and biological processes being driven on a fundamental …

Oct 5, 2023

Viacheslav Borovitskiy: Geometric Gaussian Processes

Gaussian processes (GPs) are often considered to be the gold standard in settings where well-calibrated predictive uncertainty is of …

Feb 8, 2023

Michel Tsamados: AI for polar remote sensing: making sense or making it up?

My background is in physics and my early work since being a postdoc and academic has been on model parameterizations of sea ice and …

Feb 3, 2023

Marc Killpack: Modelling and Optimal Control for Uncertain Robotic Systems

Over the last several years, our research group has worked to develop control and modeling methods for large-scale, deformable, …

Feb 2, 2023

See all

Featured Publications

Ander Gray, Vignesh Gopakumar, Sylvain Rousseau, Sebastien Destercke

2025-07-24 Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI)

Guaranteed Prediction Sets for Functional Surrogate models

We propose a method for obtaining statistically guaranteed prediction sets for functional machine learning methods: surrogate models which map between function spaces, motivated by the need to build reliable PDE emulators. The method constructs nested prediction sets on a low-dimensional representation (an SVD) of the surrogate model’s error, and then maps these sets to the prediction space using set-propagation techniques. This results in prediction sets for functional surrogate models with conformal prediction coverage guarantees. We use zonotopes as basis of the set construction, which allow an exact linear propagation and are closed under Cartesian products, making them well-suited to this high-dimensional problem. The method is model agnostic and can thus be applied to complex Sci-ML models, including Neural Operators, but also in simpler settings. We also introduce a technique to capture the truncation error of the SVD, preserving the guarantees of the method.

Vignesh Gopakumar, Ander Gray, Lorenzo Zanisi, Timothy Nunn, Daniel Giles, Matt Kusner, Stanislas Pamela, Marc P. Deisenroth

2025-07-13 Proceedings of the International Conference on Machine Learning (ICML)

Calibrated Physics-Informed Uncertainty Quantification

Neural PDEs have emerged as inexpensive surrogate models for numerical PDE solvers. While they offer efficient approximations, they often lack robust uncertainty quantification (UQ), limiting their practical utility. Existing UQ methods for these models typically have high computational demands and lack guarantees. We introduce a novel framework for calibrated physics-informed uncertainty quantification to address these limitations. Our approach leverages physics residual errors as a nonconformity score within a conformal prediction (CP) framework. This enables data-free, model-agnostic, and statistically guaranteed uncertainty estimates. Our framework utilises convolutional layers as finite difference stencils for gradient estimation, our framework provides inexpensive coverage bounds for the violation of conservation laws within model predictions. In our experiments, we utilise CP to obtain marginal coverage for each cell and joint coverage over the entire prediction domain of various PDEs.

Denis Hadjivelichkov, Sicelukwanda N. T. Zwane, Marc P. Deisenroth, Lourdes Agapito, Dimitrios Kanoulas

2025-05-19 Proceedings of the International Conference on Robotics and Automation (ICRA)

Semantic Cross-Pose Correspondence from a Single Example

This article focuses on predicting how an object can be transformed to a semantically meaningful pose relative to another object, given only one or few examples. Current pose correspondence methods rely on vast 3D object datasets and do not actively consider semantic information, which limits the objects to which they can be applied. We present a novel method for learning cross-object pose correspondence. The proposed method detects interacting object parts, performs one-shot part correspondence, and uses geometric and visual-semantic features. Given one example of two objects posed relative to each other, the model can learn how to transfer the demonstrated relations to unseen object instances.

Kenza Benkirane, Jackie Kay, Maria Perez--Ortiz

2025-04-29 Proceedings of the Conference of the Nations of the Americas Chapter of the ACL

How Can We Diagnose and Treat Bias in Large Language Models for Clinical Decision-Making?

Xiaoyuan Cheng, Yi He, Yiming Yang, Xiao Xue, Sibo Cheng, Daniel Giles, Xiaohang Tang, Yukun Hu

2025-02-07 International Conference on Learning Representations (ICLR)

Learning Chaos In A Linear Way

Learning long-term behaviors in chaotic dynamical systems, such as turbulent flows and climate modelling, is challenging due to their inherent instability and unpredictability. These systems exhibit positive Lyapunov exponents, which significantly hinder accurate long-term forecasting. As a result, understanding long-term statistical behavior is far more valuable than focusing on short-term accuracy. While autoregressive deep sequence models have been applied to capture long-term behavior, they often lead to exponentially increasing errors in learned dynamics. To address this, we shift the focus from simple prediction errors to preserving an invariant measure in dissipative chaotic systems. These systems have attractors, where trajectories settle, and the invariant measure is the probability distribution on attractors that remains unchanged under dynamics. Existing methods generate long trajectories of dissipative chaotic systems by aligning invariant measures, but it is not always possible to obtain invariant measures for arbitrary datasets. We propose the Poincaré Flow Neural Network (PFNN), a novel operator learning framework designed to capture behaviors of chaotic systems without any explicit knowledge of the invariant measure. PFNN employs an auto-encoder to map the chaotic system to a finite-dimensional feature space, effectively linearizing the chaotic evolution. It then learns the linear evolution operators to match the physical dynamics by addressing two critical properties in dissipative chaotic systems (1) contraction, the system’s convergence toward its attractors, and (2) measure invariance, trajectories on the attractors following a probability distribution invariant to the dynamics. Our experiments on a variety of chaotic systems, including Lorenz systems, Kuramoto-Sivashinsky equation and Navier–Stokes equation, demonstrate that PFNN has more accurate predictions and physical statistics compared to competitive baselines including the Fourier Neural Operator and the Markov Neural Operator.

See all publications

Sustainability and Machine Learning Group

Research group

Meet the Team

Principal Investigators

Google DeepMind Chair of Machine Learning and Artificial Intelligence

Associate Professor

Administrators

Administrator

Research Fellows

Senior Research Fellow

PhD Students

PhD Student

PhD Student

PhD Student

PhD Student

PhD Student

PhD Student

PhD Student

Affiliates

Senior Research Fellow (08/2021 - 07/2023)

Senior Research Fellow (11/2020 - 07/2023)

PhD Student

DeepMind Academic Research Fellow in AI @ QMUL

PhD Student @ JKU Linz and ELLIS Exchange Student

PhD Student

Alumni

PhD (10/2020-12/2024)

Senior Research Fellow (08/2021 - 07/2023)

PhD (09/2020-12/2024)

Senior Research Fellow (11/2020 - 07/2023)

PhD (10/2018-11/2021)

PhD (09/2019-09/2024)

PhD (10/2016-03/2021)

PhD (03/2020-05/2023)

Researcher (01/2020-10/2024)

PhD (10/2015-10/2019)

PhD (11/2016-11/2021)

Research Associate

PhD (10/2016-02/2021)

PhD (10/2017-07/2022)

PhD (10/2017-08/2022)

PhD (06/2016-03/2020)

PhD (10/2014-08/2018)

Recent Blog Posts

Recent News

Recent Publications

Recent & Upcoming Talks

Featured Publications