| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Wed, 05 Mar 2025 18:44:12 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"67c89b7c-9aba"
expires: Tue, 30 Dec 2025 13:51:57 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: C556:123DE:A1FB00:B5E6A8:6953D6A2
accept-ranges: bytes
age: 0
date: Tue, 30 Dec 2025 13:41:57 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210040-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767102117.992036,VS0,VE207
vary: Accept-Encoding
x-fastly-request-id: c8ce6817cc79e68f2938bef46f4d378925b2d25c
content-length: 8370
Leon Lang
Biography
I am a PhD student at the University of Amsterdam, working on AI safety and previously abstract information theory. In the past, I also worked on equivariant deep learning. You can find my alignment-related blogposts on Lesswrong.
I am searching for research positions in AI Safety, so if you like my profile, please reach out to me via email.
Download my CV.
Interests
- AI Alignment and Safety
- Abstract Information Theory
- Equivariant Deep Learning
Education
-
PhD in AI Safety and Information Theory, 2/2025
University of Amsterdam
-
MSc in Artificial Intelligence, 8/2020
University of Amsterdam
-
MSc in Mathematics, 10/2017
University of Bonn
-
BSc in Mathematics, 8/2015
University of Heidelberg
Selected Publications
We develop a new foundation for a theory of causality, based on factored space models
Scott Garrabrant,
Matthias Georg Mayer,
Magdalena Wache,
Leon Lang,
Sam Eisenstat,
Holger Dell
We theoretically analyze to what extent an error in a learned reward function translates into regret of resulting policies
Lukas Fluri,
Leon Lang,
Allesandro Abate,
Patrick Forré,
David Krueger,
Joar Skalse
We theoretically and empirically study safety issues of using RLHF with human evaluators that have limited information
Leon Lang,
Davis Foote,
Stuart Russell,
Anca Dragan,
Erik Jenner,
Scott Emmons
We use the recently generalized Hu Theorem to develop a theory of purely abstract Markov random fields.
Leon Lang,
Clélia de Mulatier,
Rick Quax,
Patrick Forré
We generalize information diagrams to functions beyond Shannon entropy, including Kolmogorov complexity and the generalization error from machine learning.
Leon Lang,
Pierre Baudot,
Rick Quax,
Patrick Forré