The Inside View

3. Evan Hubinger on Takeoff speeds, Risks from learned optimization & Interpretability


Listen Later

We talk about Evan’s background @ MIRI & OpenAI, Coconut, homogeneity in AI takeoff, reproducing SoTA & openness in multipolar scenarios, quantilizers & operationalizing strategy stealing, Risks from learned optimization & evolution, learned optimization in Machine Learning, clarifying Inner AI Alignment terminology, transparency & interpretability, 11 proposals for safe advanced AI, underappreciated problems in AI Alignment & surprising advances in AI.

...more
View all episodesView all episodes
Download on the App Store

The Inside ViewBy Michaël Trazzi

  • 2
  • 2
  • 2
  • 2
  • 2

2

1 ratings