London Futurists

Aligning AI, before it's too late, with Rebecca Gorman


Listen Later

Our guest in this episode is Rebecca Gorman, the co-founder and CEO of Aligned AI, a start-up in Oxford which describes itself rather nicely as working to get AI to do more of the things it should do and fewer of the things it shouldn’t.

Rebecca built her first AI system 20 years ago and has been calling for responsible AI development since 2010. With her co-founder Stuart Armstrong, she has co-developed several advanced methods for AI alignment, and she has advised the EU, UN, OECD and the UK Parliament on the governance and regulation of AI.

The conversation highlights the tools faAIr, EquitAI, and ACE, developed by Aligned AI. It also covers the significance of recent performance by Aligned AI software in the CoinRun test environment, which demonstrates the important principle of "overcoming goal misgeneralisation". 

Selected follow-ups:

  • buildaligned.ai
  • Article: "Using faAIr to measure gender bias in LLMs"
  • Article: "EquitAI: A gender bias mitigation tool for generative AI"
  • Article: "ACE for goal generalisation"
  • "CoinRun: Solving Goal Misgeneralisation" - a publication on arXiv
  • Aligned AI repositories on GitHub
  • "Specification gaming examples in AI" - article by Victoria Krakovna
  • Rebecca Gorman speaking at the Cambridge Union on "This House Believes Artificial Intelligence Is An Existential Threat" (YouTube)

Music: Spike Protein, by Koi Discovery, available under CC0 1.0 Public Domain Declaration





How Hacks Happen

Hacks, scams, cyber crimes, and other shenanigans explored and explained. Presented...

Listen on: Apple Podcasts   Spotify

...more
View all episodesView all episodes
Download on the App Store

London FuturistsBy London Futurists

  • 4.7
  • 4.7
  • 4.7
  • 4.7
  • 4.7

4.7

9 ratings


More shows like London Futurists

View all
Freakonomics Radio by Freakonomics Radio + Stitcher

Freakonomics Radio

32,007 Listeners

Philosophize This! by Stephen West

Philosophize This!

15,209 Listeners

More or Less by BBC Radio 4

More or Less

887 Listeners

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,320 Listeners

Uncanny Valley | WIRED by WIRED

Uncanny Valley | WIRED

502 Listeners

Team Human by Douglas Rushkoff

Team Human

368 Listeners

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas by Sean Carroll | Wondery

Sean Carroll's Mindscape: Science, Society, Philosophy, Culture, Arts, and Ideas

4,178 Listeners

Everything Electric Podcast by The Fully Charged Show

Everything Electric Podcast

318 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

201 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

512 Listeners

Hard Fork by The New York Times

Hard Fork

5,507 Listeners

Clearer Thinking with Spencer Greenberg by Spencer Greenberg

Clearer Thinking with Spencer Greenberg

138 Listeners

Moonshots with Peter Diamandis by PHD Ventures

Moonshots with Peter Diamandis

547 Listeners

Inner Cosmos with David Eagleman by iHeartPodcasts

Inner Cosmos with David Eagleman

586 Listeners

Prof G Markets by Vox Media Podcast Network

Prof G Markets

1,425 Listeners