.NET Rocks!

Measuring LLMs with Jodie Burchell


Listen Later

How do you measure the quality of a large language model? Carl and Richard talk to Dr. Jodie Burchell about her work measuring large language models for accuracy, reliability, and consistency. Jodie talks about the variety of benchmarks that exist for LLMs and the problems they have. A broader conversation about quality digs into the idea that LLMs should be targeted to the particular topic area they are being used for - often, smaller is better! Building a good test suite for your LLM is challenging but can increase your confidence that the tool will work as expected.
...more
View all episodesView all episodes
Download on the App Store

.NET Rocks!By Carl Franklin and Richard Campbell

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

242 ratings


More shows like .NET Rocks!

View all
Hanselminutes with Scott Hanselman by Scott Hanselman

Hanselminutes with Scott Hanselman

380 Listeners

.NET Rocks! by Carl Franklin and Richard Campbell

.NET Rocks!

38 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

288 Listeners

Security Now (Audio) by TWiT

Security Now (Audio)

2,010 Listeners

MacBreak Weekly (Audio) by TWiT

MacBreak Weekly (Audio)

2,012 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

630 Listeners

RunAs Radio by Richard Campbell

RunAs Radio

83 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

583 Listeners

Merge Conflict by soundbite.fm

Merge Conflict

86 Listeners

Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

Syntax - Tasty Web Development Treats

990 Listeners

Tech Brew Ride Home by Morning Brew

Tech Brew Ride Home

970 Listeners

Practical AI by Practical AI LLC

Practical AI

211 Listeners

Azure & DevOps Podcast by Jeffrey Palermo

Azure & DevOps Podcast

22 Listeners

The Unhandled Exception Podcast by Dan Clarke (@dracan)

The Unhandled Exception Podcast

3 Listeners

Security This Week by Carl Franklin

Security This Week

18 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

681 Listeners