Foresight Institute Radio

Jan Leike | Superintelligent Alignment


Listen Later

Jan Leike is a Research Scientist at Google DeepMind and a leading voice in AI Alignment, with affiliations at the Future of Humanity Institute and the Machine Intelligence Research Institute. At OpenAI, he co-leads the Superalignment Team, contributing to AI advancements such as InstructGPT and ChatGPT. Holding a PhD from the Australian National University, Jan's work focuses on ensuring AI Alignment.


Key Highlights

  • The launch of OpenAI's Superalignment team, targeting the alignment of superintelligence in four years.
  • The aim to automate of alignment research, currently leveraging 20% of OpenAI's computational power.
  • How traditional reinforcement learning from human feedback may fall short in scaling language model alignment.
  • Why there is a focus on scalable oversight, generalization, automation interpretability, and adversarial testing to ensure alignment reliability.
  • Experimentation with intentionally misaligned models to evaluate alignment strategies.


Dive deeper into the session: Full Summary


About Foresight Institute

Foresight Institute is a research organization and non-profit that supports the beneficial development of high-impact technologies. Since our founding in 1987 on a vision of guiding powerful technologies, we have continued to evolve into a many-armed organization that focuses on several fields of science and technology that are too ambitious for legacy institutions to support.


Allison Duettmann

The President and CEO of Foresight Institute, Allison Duettmann directs the Intelligent Cooperation, Molecular Machines, Biotech & Health Extension, Neurotech, and Space Programs, alongside Fellowships, Prizes, and Tech Trees. She has also been pivotal in co-initiating the Longevity Prize, pioneering initiatives like Existentialhope.com, and contributing to notable works like "Superintelligence: Coordination & Strategy" and "Gaming the Future".


Get Involved with Foresight:

  • Apply: Virtual Salons & in-person Workshops
  • Donate: Support Our Work – If you enjoy what we do, please consider this, as we are entirely funded by your donations!


Follow Us: Twitter | Facebook | LinkedIn


Note: Explore every word spoken on this podcast through Fathom.fm, an innovative podcast search engine.

Hosted on Acast. See acast.com/privacy for more information.

...more
View all episodesView all episodes
Download on the App Store

Foresight Institute RadioBy Foresight Institute

  • 5
  • 5
  • 5
  • 5
  • 5

5

4 ratings


More shows like Foresight Institute Radio

View all
The Bill Simmons Podcast by The Ringer

The Bill Simmons Podcast

30,189 Listeners

StarTalk Radio by Neil deGrasse Tyson

StarTalk Radio

14,329 Listeners

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,316 Listeners

Future of Life Institute Podcast by Future of Life Institute

Future of Life Institute Podcast

112 Listeners

Where Should We Begin? with Esther Perel by Esther Perel Global Media

Where Should We Begin? with Esther Perel

14,883 Listeners

Modern Wisdom by Chris Williamson

Modern Wisdom

4,019 Listeners

Practical AI by Practical AI LLC

Practical AI

203 Listeners

Interesting Times with Ross Douthat by New York Times Opinion

Interesting Times with Ross Douthat

7,227 Listeners

Your Undivided Attention by The Center for Humane Technology, Tristan Harris, Daniel Barcay and Aza Raskin

Your Undivided Attention

1,607 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

527 Listeners

Hard Fork by The New York Times

Hard Fork

5,527 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,022 Listeners

Robinson's Podcast by Robinson Erhardt

Robinson's Podcast

265 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

629 Listeners

The Last Invention by Longview

The Last Invention

1,099 Listeners