
Sign up to save your podcasts
Or


Ever wonder who’s really winning the Chatbot Arena and whether those wins mean anything at all? In this episode of Generative AI 101, host Emily Laird's blowing the lid off the leaderboard. Turns out, the top bots might’ve had a little… help. Like submitting 27 secret versions and quietly deleting the losers help. We break down The Leaderboard Illusion, a new research paper, is exposing how big tech plays with the rules, while open-source models get ghosted like last year’s crypto pitch. From rigged matchups to sketchy score retractions and mysteriously vanished models, this one’s part statistical roast, part AI crime scene investigation. Spoiler: the leaderboard might be lying to you.
The Leaderboard Illusion Paper
Connect with Emily Laird on LinkedIn
By Emily Laird4.6
1919 ratings
Ever wonder who’s really winning the Chatbot Arena and whether those wins mean anything at all? In this episode of Generative AI 101, host Emily Laird's blowing the lid off the leaderboard. Turns out, the top bots might’ve had a little… help. Like submitting 27 secret versions and quietly deleting the losers help. We break down The Leaderboard Illusion, a new research paper, is exposing how big tech plays with the rules, while open-source models get ghosted like last year’s crypto pitch. From rigged matchups to sketchy score retractions and mysteriously vanished models, this one’s part statistical roast, part AI crime scene investigation. Spoiler: the leaderboard might be lying to you.
The Leaderboard Illusion Paper
Connect with Emily Laird on LinkedIn

333 Listeners

152 Listeners

211 Listeners

197 Listeners

154 Listeners

227 Listeners

610 Listeners

274 Listeners

106 Listeners

54 Listeners

173 Listeners

57 Listeners

146 Listeners

62 Listeners

24 Listeners