AI Safety Fundamentals

Introduction to Mechanistic Interpretability


Listen Later

Our introduction introduces common mech interp concepts, to prepare you for the rest of this session's resources.

Original text: https://aisafetyfundamentals.com/blog/introduction-to-mechanistic-interpretability/

Author(s): Sarah Hastings-Woodhouse

A podcast by BlueDot Impact.

Learn more on the AI Safety Fundamentals website.

...more
View all episodesView all episodes
Download on the App Store

AI Safety FundamentalsBy BlueDot Impact