.NET Rocks!

Measuring LLMs with Jodie Burchell


Listen Later

How do you measure the quality of a large language model? Carl and Richard talk to Dr. Jodie Burchell about her work measuring large language models for accuracy, reliability, and consistency. Jodie talks about the variety of benchmarks that exist for LLMs and the problems they have. A broader conversation about quality digs into the idea that LLMs should be targeted to the particular topic area they are being used for - often, smaller is better! Building a good test suite for your LLM is challenging but can increase your confidence that the tool will work as expected.
...more
View all episodesView all episodes
Download on the App Store

.NET Rocks!By Carl Franklin and Richard Campbell

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

241 ratings


More shows like .NET Rocks!

View all
Hanselminutes with Scott Hanselman by Scott Hanselman

Hanselminutes with Scott Hanselman

379 Listeners

Software Engineering Radio by se-radio@computer.org

Software Engineering Radio

273 Listeners

.NET Rocks! by Carl Franklin and Richard Campbell

.NET Rocks!

38 Listeners

Windows Weekly (Audio) by TWiT

Windows Weekly (Audio)

880 Listeners

RunAs Radio by Richard Campbell

RunAs Radio

83 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

288 Listeners

Thoughtworks Technology Podcast by Thoughtworks

Thoughtworks Technology Podcast

42 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

587 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

625 Listeners

Merge Conflict by soundbite.fm

Merge Conflict

87 Listeners

Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

Syntax - Tasty Web Development Treats

983 Listeners

CoRecursive: Coding Stories by Adam Gordon Bell - Software Developer

CoRecursive: Coding Stories

188 Listeners

The Stack Overflow Podcast by The Stack Overflow Podcast

The Stack Overflow Podcast

62 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

141 Listeners

Security This Week by Carl Franklin

Security This Week

19 Listeners

Oxide and Friends by Oxide Computer Company

Oxide and Friends

59 Listeners