
Sign up to save your podcasts
Or


In this episode, Caterina Constantinescu dives deep into Large Language Models (LLMs), spotlighting top leaderboards, evaluation benchmarks, and real-world user perceptions. Plus, discover the challenges of dataset contamination and the intricacies of platforms like HELM and Chatbot Arena.
Additional materials: www.superdatascience.com/706
Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
By Jon Krohn4.6
295295 ratings
In this episode, Caterina Constantinescu dives deep into Large Language Models (LLMs), spotlighting top leaderboards, evaluation benchmarks, and real-world user perceptions. Plus, discover the challenges of dataset contamination and the intricacies of platforms like HELM and Chatbot Arena.
Additional materials: www.superdatascience.com/706
Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

479 Listeners

626 Listeners

585 Listeners

333 Listeners

152 Listeners

269 Listeners

210 Listeners

142 Listeners

95 Listeners

133 Listeners

153 Listeners

225 Listeners

607 Listeners

273 Listeners

39 Listeners