The Gradient: Perspectives on AI

Catherine Olsson and Nelson Elhage: Anthropic, Understanding Transformers


Listen Later

In episode 40 of The Gradient Podcast, Andrey Kurenkov speaks to Catherine Olsson and Nelson Elhage.

Catherine and Nelson are both members of technical staff at Anthropic, which is an AI safety and research company that’s working to build reliable, interpretable, and steerable AI systems. Catherine and Nelson’s focus is on interpretability, and we will discuss several of their recent works in this interview. 
Follow The Gradient on Twitter

Outline:

(00:00) Intro
(01:10) Catherine’s Path into AI
(03:25) Nelson’s Path into AI
(05:23) Overview of Anthropic
(08:21) Mechanistic Interpretability
(15:15) Transformer Circuits 
(21:30) Toy Transformer
(27:25) Induction Heads
(31:00) In-Context Learning
(35:10) Evidence for Induction Heads Enabling In-Context Learning
(39:30) What’s Next
(43:10) Replicating Results
(46:00) Outro

Links:

Anthropic

Zoom In: An Introduction to Circuits

Mechanistic Interpretability, Variables, and the Importance of Interpretable Bases

A Mathematical Framework for Transformer Circuits

In-context Learning and Induction Heads 

PySvelte



Get full access to The Gradient at thegradientpub.substack.com/subscribe
...more
View all episodesView all episodes
Download on the App Store

The Gradient: Perspectives on AIBy Daniel Bashir

  • 4.7
  • 4.7
  • 4.7
  • 4.7
  • 4.7

4.7

47 ratings


More shows like The Gradient: Perspectives on AI

View all
The Joe Rogan Experience by Joe Rogan

The Joe Rogan Experience

229,965 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,095 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

349 Listeners

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas by Sean Carroll

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

4,176 Listeners

Practical AI by Practical AI LLC

Practical AI

209 Listeners

The Journal. by The Wall Street Journal & Spotify Studios

The Journal.

6,114 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,227 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

548 Listeners

Hard Fork by The New York Times

Hard Fork

5,547 Listeners

The Rest Is History by Goalhanger

The Rest Is History

15,859 Listeners

Huberman Lab by Scicomm Media

Huberman Lab

29,346 Listeners

Disintegrator by Roberto Alonso Trillo, Marek Poliks, and Helena McFadzean

Disintegrator

14 Listeners

Practical: AI & Business News by Practical News

Practical: AI & Business News

26 Listeners