<a href='https://stephencasper.com/'>Stephen Casper</a>, a computer science PhD student at MIT, joined the podcast to discuss AI interpretability, red-teaming and robustness, evaluations and audits, reinforcement learning from human feedback, Goodhart’s law, and more.
Our music is by <a href='https://www.linkedin.com/in/micah-rubin-0575421b2/'>Micah Rubin</a> (Producer) and <a href='https://www.instagram.com/twinbrothermusic/'>John Lisi</a> (Composer).
For a transcript and relevant links, visit the <a href='https://aipolicypod.substack.com/'>Center for AI Policy Podcast Substack</a>.

Stephen Casper, a computer science PhD student at MIT, joined the podcast to discuss AI interpretability, red-teaming and robustness, evaluations and audits, reinforcement learning from human feedback, Goodhart’s law, and more. Our music is by Micah Rubin (Producer) and John Lisi (Composer). For a transcript and relevant links, visit the Center for AI Policy Podcast Substack.

<a href="https://stephencasper.com/" rel="noopener noreferrer">Stephen Casper</a>, a computer science PhD student at MIT, joined the podcast to discuss AI interpretability, red-teaming and robustness, evaluations and audits, reinforcement learning from human feedback, Goodhart’s law, and more.
Our music is by <a href="https://www.linkedin.com/in/micah-rubin-0575421b2/" rel="noopener noreferrer">Micah Rubin</a> (Producer) and <a href="https://www.instagram.com/twinbrothermusic/" rel="noopener noreferrer">John Lisi</a> (Composer).
For a transcript and relevant links, visit the <a href="https://aipolicypod.substack.com/" rel="noopener noreferrer">Center for AI Policy Podcast Substack</a>.

#10: Stephen Casper on Technical and Sociotechnical AI Safety Research

The Center for AI Policy Podcast zooms into the strategic landscape of AI and unpacks its implications for US policy.

This podcast is a publication from the Center for AI Policy (CAIP), a nonpartisan research organization dedicated to mitigating the catastrophic risks of AI through policy development and advocacy. Based in Washington, DC, CAIP works to ensure AI is developed and implemented with effective safety standards.

Technology

Government

The Center for AI Policy Podcast zooms into the strategic landscape of AI and unpacks its implications for US policy. This podcast is a publication from the Center for AI Policy (CAIP), a nonpartisan research organization dedicated to mitigating the catastrophic risks of AI through policy development and advocacy. Based in Washington, DC, CAIP works to ensure AI is developed and implemented with effective safety standards.

Share #10: Stephen Casper on Technical and Sociotechnical AI Safety Research

Sign up to save your podcasts

#10: Stephen Casper on Technical and Sociotechnical AI Safety Research

#10: Stephen Casper on Technical and Sociotechnical AI Safety Research