Transformer Debugger (TDB) is a research tool developed by OpenAI’s Superalignment team to investigate and interpret the behaviors of small language models. It combines automated interpretability methods with sparse autoencoders, enabling researchers to analyze how specific neurons, attention heads, and latent features contribute to a model’s outputs. TDB allows users to intervene directly in the forward pass of a model and observe how such interventions change predictions, making it possible to answer questions like why a token was selected or why an attention head focused on a certain input. It automatically identifies and explains the most influential components, highlights activation patterns, and maps relationships across circuits within the model. The tool includes both a React-based neuron viewer for exploring model components and a backend activation server for running inferences and serving data.

Features

  • Investigates behaviors of small language models with interpretability tools
  • Intervenes in the forward pass to test effects on outputs
  • Identifies and explains neuron, attention head, and latent activations
  • Provides a React-based neuron viewer for interactive exploration
  • Includes an activation server and inference hooks for GPT-2 models
  • Offers collated activation datasets for deeper analysis

Project Activity

See All Activity >

Categories

AI Models

License

MIT License

Follow Transformer Debugger

Transformer Debugger Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Transformer Debugger!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python, TypeScript

Related Categories

Python AI Models, TypeScript AI Models

Registered

2025-10-03