Snacks Weekly on Data Science

Evaluate LLM-based chatbots performance [Microsoft]


Listen Later

In this episode, we will explore why evaluating LLM-based chatbots is critical for businesses, the limitations of traditional evaluation methods, and what could be a good robust evaluation framework covering both search performance and LLM-specific metrics. 
For more details, you can refer to their published tech blog, linked here for your reference: https://medium.com/data-science-at-microsoft/evaluating-llm-based-chatbots-a-comprehensive-guide-to-performance-metrics-9c2388556d3e

...more
View all episodesView all episodes
Download on the App Store

Snacks Weekly on Data ScienceBy Pan Wu

  • 5
  • 5
  • 5
  • 5
  • 5

5

5 ratings


More shows like Snacks Weekly on Data Science

View all
Planet Money by NPR

Planet Money

30,875 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

293 Listeners

The Best One Yet by Nick & Jack Studios

The Best One Yet

9,549 Listeners