.NET Rocks!

Measuring LLMs with Jodie Burchell


Listen Later

How do you measure the quality of a large language model? Carl and Richard talk to Dr. Jodie Burchell about her work measuring large language models for accuracy, reliability, and consistency. Jodie talks about the variety of benchmarks that exist for LLMs and the problems they have. A broader conversation about quality digs into the idea that LLMs should be targeted to the particular topic area they are being used for - often, smaller is better! Building a good test suite for your LLM is challenging but can increase your confidence that the tool will work as expected.
...more
View all episodesView all episodes
Download on the App Store

.NET Rocks!By Carl Franklin

  • 5
  • 5
  • 5
  • 5
  • 5

5

5 ratings


More shows like .NET Rocks!

View all
Software Engineering Radio by se-radio@computer.org

Software Engineering Radio

273 Listeners

Hanselminutes with Scott Hanselman by Scott Hanselman

Hanselminutes with Scott Hanselman

380 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

290 Listeners

The Infinite Monkey Cage by BBC Radio 4

The Infinite Monkey Cage

1,931 Listeners

SpaceTime: Your Guide to Space & Astronomy by Stuart Gary

SpaceTime: Your Guide to Space & Astronomy

298 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

1,087 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

625 Listeners

LINUX Unplugged by Jupiter Broadcasting

LINUX Unplugged

265 Listeners

Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

Syntax - Tasty Web Development Treats

986 Listeners

REWORK by 37signals

REWORK

210 Listeners

CoRecursive: Coding Stories by Adam Gordon Bell - Software Developer

CoRecursive: Coding Stories

188 Listeners

.NET Rocks! by Carl Franklin and Richard Campbell

.NET Rocks!

245 Listeners

The Stack Overflow Podcast by The Stack Overflow Podcast

The Stack Overflow Podcast

62 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

476 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

561 Listeners