
Sign up to save your podcasts
Or


How do you test a GenAI application that's constantly changing? In this episode, Shane talks to Leonard Tang, co-founder of Haize Labs, about why traditional testing fails for LLMs and how to adopt a new evaluation strategy. Leonard introduces "fuzzing"—a powerful technique for discovering edge cases, improving reliability, and building AI you can actually trust. He also gives a live demo of the Haize Labs platform, so be sure to watch the video version on YouTube or Spotify to see it in action.
By MongoDB4.9
7272 ratings
How do you test a GenAI application that's constantly changing? In this episode, Shane talks to Leonard Tang, co-founder of Haize Labs, about why traditional testing fails for LLMs and how to adopt a new evaluation strategy. Leonard introduces "fuzzing"—a powerful technique for discovering edge cases, improving reliability, and building AI you can actually trust. He also gives a live demo of the Haize Labs platform, so be sure to watch the video version on YouTube or Spotify to see it in action.

30,650 Listeners

43,778 Listeners

292 Listeners

1,099 Listeners

623 Listeners

586 Listeners

30,234 Listeners

2,130 Listeners

112,617 Listeners

984 Listeners

961 Listeners

62 Listeners

141 Listeners

194 Listeners

591 Listeners