Thomas Bush

Oxford

prof_pic.jpg

Hi! I’m Tom!

I am currently a DPhil student within the AIMS CDT at the university of Oxford. Until recently, I was a MATS scholar supervised by Adria Garriga-Alonso, and a research assistant at Krueger AI Safety Lab supervised by Prof David Krueger, and working with Usman Anwar and Stephen Chung. Before that, I was a MPhil student in Machine Learning and Machine Intelligence at the University of Cambridge, and a BSc student in Philosophy, Politics and Economics at LSE.

My goal is to ensure advanced AI lives up to its potential to benefit humanity. My current research revolves around the unifying theme of safe, beneficial and reliable AI. A few of the many questions I am currently interested in are as follows:

  1. How can we teach models to reason in a safe and reliable fashion? How can we verify whether models reason in such a way?
  2. How can we increase societal resilience to security threats posed by autonomous intelligent systems?
  3. How can we teach models to pursue the ends we want them to pursue?
  4. How can advances in AI be deployed to benefit humanity at scale?

selected publications

  1. emgplanning_titlegif.gif
    Interpreting Emergent Planning in Model-Free Reinforcement Learning
    Thomas Bush, Stephen Chung, Usman Anwar, and 2 more authors
    In The Thirteenth International Conference on Learning Representations (Oral), 2025