
Sign up to save your podcasts
Or


How do you systematically measure, optimize, and improve the performance of LLM applications (like those powered by RAG or tool use)? Ragas is an open source effort that has been trying to answer this question comprehensively, and they are promoting a “Metrics Driven Development” approach. Shahul from Ragas joins us to discuss Ragas in this episode, and we dig into specific metrics, the difference between benchmarking models and evaluating LLM apps, generating synthetic test data and more.
Sponsors:
Featuring:
Show Notes:
Upcoming Events:
By Practical AI LLC4.4
189189 ratings
How do you systematically measure, optimize, and improve the performance of LLM applications (like those powered by RAG or tool use)? Ragas is an open source effort that has been trying to answer this question comprehensively, and they are promoting a “Metrics Driven Development” approach. Shahul from Ragas joins us to discuss Ragas in this episode, and we dig into specific metrics, the difference between benchmarking models and evaluating LLM apps, generating synthetic test data and more.
Sponsors:
Featuring:
Show Notes:
Upcoming Events:

288 Listeners

1,105 Listeners

166 Listeners

432 Listeners

302 Listeners

347 Listeners

319 Listeners

98 Listeners

146 Listeners

101 Listeners

226 Listeners

693 Listeners

112 Listeners

56 Listeners

32 Listeners