Bretton Goods

Ep 48: An Introduction to AI Alignment with Trevor Chow


Listen Later

I spoke to Trevor Chow about existential risks from AI and techniques to align artificial intelligence with human goals. Specifically we talked about

  • An introduction to existential risk from Artificial Intelligence
  • Existing methods for alignment of AI models
  • Why RLHF might fail in large language models
  • Whether interpretability research might scale?
  • New methods being developed to make larger models safer
  • Regulatory frameworks for the future of AI
  • ---
    Send in a voice message: https://podcasters.spotify.com/pod/show/pradyumna-sp/message
    ...more
    View all episodesView all episodes
    Download on the App Store

    Bretton GoodsBy Pradyumna Prasad

    • 5
    • 5
    • 5
    • 5
    • 5

    5

    1 ratings