May 28, 2023

Ep 48: An Introduction to AI Alignment with Trevor Chow

Listen Later

57 minutes

I spoke to Trevor Chow about existential risks from AI and techniques to align artificial intelligence with human goals. Specifically we talked about

An introduction to existential risk from Artificial Intelligence

Existing methods for alignment of AI models

Why RLHF might fail in large language models

Whether interpretability research might scale?

New methods being developed to make larger models safer

Regulatory frameworks for the future of AI

---

Send in a voice message: https://podcasters.spotify.com/pod/show/pradyumna-sp/message

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Bretton Goods

By Pradyumna Prasad

5

11 ratings

May 28, 2023

Ep 48: An Introduction to AI Alignment with Trevor Chow

Listen Later

57 minutes

I spoke to Trevor Chow about existential risks from AI and techniques to align artificial intelligence with human goals. Specifically we talked about

An introduction to existential risk from Artificial Intelligence

Existing methods for alignment of AI models

Why RLHF might fail in large language models

Whether interpretability research might scale?

New methods being developed to make larger models safer

Regulatory frameworks for the future of AI

---

Send in a voice message: https://podcasters.spotify.com/pod/show/pradyumna-sp/message

...more