
Sign up to save your podcasts
Or


How do you test a GenAI application that's constantly changing? In this episode, Shane talks to Leonard Tang, co-founder of Haize Labs, about why traditional testing fails for LLMs and how to adopt a new evaluation strategy. Leonard introduces "fuzzing"—a powerful technique for discovering edge cases, improving reliability, and building AI you can actually trust. He also gives a live demo of the Haize Labs platform, so be sure to watch the video version on YouTube or Spotify to see it in action.
By MongoDB4.9
7272 ratings
How do you test a GenAI application that's constantly changing? In this episode, Shane talks to Leonard Tang, co-founder of Haize Labs, about why traditional testing fails for LLMs and how to adopt a new evaluation strategy. Leonard introduces "fuzzing"—a powerful technique for discovering edge cases, improving reliability, and building AI you can actually trust. He also gives a live demo of the Haize Labs platform, so be sure to watch the video version on YouTube or Spotify to see it in action.

30,713 Listeners

43,592 Listeners

289 Listeners

1,084 Listeners

626 Listeners

585 Listeners

30,219 Listeners

2,173 Listeners

112,484 Listeners

987 Listeners

962 Listeners

64 Listeners

142 Listeners

191 Listeners

608 Listeners