Georg Lange

Georg Lange Georg Lange

Independent Researcher

Biography

I’m an independent researcher working on Mechanistic Interpretability for LLMs. I was a MATS scholar and worked with Alex Makelov and Neel Nanda on Sparse Autoencoders and Distributed Alignment Search for feature detection and subspace activation patching. Previously, I was a MSc AI student at the University of Amsterdam, where I worked on brain-like interpretable spatiotemporal Computer Vision models, supervised by Prof Iris Groen and Amber Brands.

Further, I was a graduate student at the Graduate Center, CUNY, where I worked on Reinforcement Learning, Decision Making, and Reward Sensitization and conducted fiber photometry experiments in the Nucleus Accumbens of mice, supervised by Prof Jeff Beeler.

Interests

Artificial Intelligence
Mechanistic Interpretability
Systems Neuroscience

Education

M.Sc. Artificial Intelligence
University of Amsterdam
M.Sc. Cognitive Neuroscience
Graduate Center, City University of New York
B.Sc. IT-Systems Engineering
Hasso-Plattner-Institut, Potsdam

Featured Publications

Towards principled evaluations of sparse autoencoders for interpretability and control

Mechanistic Interpretability

Towards principled evaluations of sparse autoencoders for interpretability and control

We propose a framework for evaluating sparse dictionary learning methods in mechanistic interpretability by comparing them against supervised feature dictionaries. Using the indirect object identification task as a case study, we show that while sparse autoencoders can capture interpretable features, they face challenges like feature occlusion and over-splitting that limit their effectiveness for model control compared to supervised approaches.

Apr 9, 2025

Accumbal Dopamine and Acetylcholine Dynamics during Psychostimulant Sensitization

Systems Neuroscience

Accumbal Dopamine and Acetylcholine Dynamics during Psychostimulant Sensitization

We examine how cholinergic interneuron (CIN)-specific deletion of D2 receptors affects dopamine (DA) and acetylcholine (ACh) dynamics in response to repeated psychostimulant exposure. Using dual-color photometry, we find that mice lacking D2 receptors on CINs fail to sensitize DA and ACh signaling but still develop behavioral sensitization, albeit more slowly. This suggests behavioral sensitization occurs independently from neuromodulatory sensitization driven by CIN-expressed D2 receptors.

Apr 3, 2025

Temporal adaptation aids object recognition in deep convolutional neural networks in suboptimal viewing scenario's

Computational Neuroscience

Temporal adaptation aids object recognition in deep convolutional neural networks in suboptimal viewing scenario's

We compare how intrinsic and recurrent temporal adaptation mechanisms in deep neural networks affect object recognition under challenging conditions. We find intrinsic adaptation is superior for recognizing simple, high-contrast objects in noise, whereas recurrent adaptation better maintains coherence under dynamic occlusion and improves novelty detection. These results indicate that robust object recognition likely depends on multiple parallel adaptation strategies.

Apr 3, 2025

Is This the Subspace You Are Looking For? An Interpretability Illusion for Subspace Activation Patching

Mechanistic Interpretability

Is This the Subspace You Are Looking For? An Interpretability Illusion for Subspace Activation Patching

We show that subspace interventions in mechanistic interpretability can be misleading - even when they successfully modify model behavior, they may do so by activating alternative pathways rather than manipulating the intended feature. We demonstrate this phenomenon in mathematical examples and real-world tasks, while also showing what successful interpretable interventions look like when guided by prior circuit analysis.

May 9, 2024

Recent Publications

Georg Lange, Alex Makelov*, Neel Nanda (2025). Towards principled evaluations of sparse autoencoders for interpretability and control. In ICLR 2025.

PDF Cite Twitter Arxiv

Georg Lange, Federico Gnazzo, Jeff Beeler (2025). Accumbal Dopamine and Acetylcholine Dynamics during Psychostimulant Sensitization. BioRxiv Preprint.

Amber Brands, Georg Lange, Iris Groen (2025). Temporal adaptation aids object recognition in deep convolutional neural networks in suboptimal viewing scenario's. BioRxiv Preprint.

Georg Lange, Alex Makelov*, Neel Nanda (2024). Is This the Subspace You Are Looking For? An Interpretability Illusion for Subspace Activation Patching. In ICLR 2024.

PDF Cite Code Twitter Published Version

Projects

Fibermagic

Fiber Photometry

Fibermagic

Fibermagic is a Python library for large-scale analysis of fiber photometry data.

Oct 26, 2022

Online Course on Deep Learning for Computer Vision

Online Course on Deep Learning for Computer Vision

This online course provides a comprehensive introduction to neural networks and deep learning, with a focus on computer vision applications. Starting from the theoretical foundations of artificial intelligence, the course covers both practical implementations and advanced concepts. Students learn how neural networks function, how to develop and deploy them, and explore training algorithms through hands-on exercises. The course also teaches optimization techniques and strategies for training with limited data. By the end, participants will understand how to build and optimize neural networks for their own applications.

Mar 20, 2020