
Sign up to save your podcasts
Or


In this episode, Caterina Constantinescu dives deep into Large Language Models (LLMs), spotlighting top leaderboards, evaluation benchmarks, and real-world user perceptions. Plus, discover the challenges of dataset contamination and the intricacies of platforms like HELM and Chatbot Arena.
Additional materials: www.superdatascience.com/706
Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
 By Jon Krohn
By Jon Krohn4.6
294294 ratings
In this episode, Caterina Constantinescu dives deep into Large Language Models (LLMs), spotlighting top leaderboards, evaluation benchmarks, and real-world user perceptions. Plus, discover the challenges of dataset contamination and the intricacies of platforms like HELM and Chatbot Arena.
Additional materials: www.superdatascience.com/706
Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

475 Listeners

1,084 Listeners

339 Listeners

769 Listeners

156 Listeners

268 Listeners

210 Listeners

141 Listeners

90 Listeners

132 Listeners

151 Listeners

208 Listeners

562 Listeners

265 Listeners

70 Listeners