July 21, 2025

AI Testing and Evaluation: Reflections

Listen Later

29 minutes

In the series finale, Amanda Craig Deckard returns to examine what Microsoft has learned about testing as a governance tool. She also explores the roles of rigor, standardization, and interpretability in testing and what’s next for Microsoft’s AI governance work.

Show notes: https://www.microsoft.com/en-us/research/podcast/ai-testing-and-evaluation-reflections/

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Microsoft Research Podcast

By Researchers across the Microsoft research community

4.8

8080 ratings

July 21, 2025

AI Testing and Evaluation: Reflections

Listen Later

29 minutes

In the series finale, Amanda Craig Deckard returns to examine what Microsoft has learned about testing as a governance tool. She also explores the roles of rigor, standardization, and interpretability in testing and what’s next for Microsoft’s AI governance work.

Show notes: https://www.microsoft.com/en-us/research/podcast/ai-testing-and-evaluation-reflections/

...more

More shows like Microsoft Research Podcast

The Daily by The New York Times

The Daily

111,948 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

576 Listeners

Hard Fork by The New York Times

Hard Fork

5,530 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

143 Listeners