Microsoft Research Podcast

AI Testing and Evaluation: Reflections


Listen Later

In the series finale, Amanda Craig Deckard returns to examine what Microsoft has learned about testing as a governance tool. She also explores the roles of rigor, standardization, and interpretability in testing and what’s next for Microsoft’s AI governance work.

Show notes: https://www.microsoft.com/en-us/research/podcast/ai-testing-and-evaluation-reflections/

...more
View all episodesView all episodes
Download on the App Store

Microsoft Research PodcastBy Researchers across the Microsoft research community

  • 4.8
  • 4.8
  • 4.8
  • 4.8
  • 4.8

4.8

80 ratings


More shows like Microsoft Research Podcast

View all
The Daily by The New York Times

The Daily

113,035 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

555 Listeners

Hard Fork by The New York Times

Hard Fork

5,555 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

142 Listeners