My Weird Prompts

How Do You QA a Probabilistic System?


Listen Later

Traditional unit tests fail for probabilistic LLMs. We break down the modern toolkit for automated quality evaluation, from heuristic safety nets to LLM-as-judge grading. Learn how to catch hallucinations, manage bias, and build a manufacturing line for intelligence that actually scales.
...more
View all episodesView all episodes
Download on the App Store

My Weird PromptsBy Daniel Rosehill