
Sign up to save your podcasts
Or


Ever wonder who’s really winning the Chatbot Arena and whether those wins mean anything at all? In this episode of Generative AI 101, host Emily Laird's blowing the lid off the leaderboard. Turns out, the top bots might’ve had a little… help. Like submitting 27 secret versions and quietly deleting the losers help. We break down The Leaderboard Illusion, a new research paper, is exposing how big tech plays with the rules, while open-source models get ghosted like last year’s crypto pitch. From rigged matchups to sketchy score retractions and mysteriously vanished models, this one’s part statistical roast, part AI crime scene investigation. Spoiler: the leaderboard might be lying to you.
The Leaderboard Illusion Paper
Connect with Emily Laird on LinkedIn
By Emily Laird4.6
2020 ratings
Ever wonder who’s really winning the Chatbot Arena and whether those wins mean anything at all? In this episode of Generative AI 101, host Emily Laird's blowing the lid off the leaderboard. Turns out, the top bots might’ve had a little… help. Like submitting 27 secret versions and quietly deleting the losers help. We break down The Leaderboard Illusion, a new research paper, is exposing how big tech plays with the rules, while open-source models get ghosted like last year’s crypto pitch. From rigged matchups to sketchy score retractions and mysteriously vanished models, this one’s part statistical roast, part AI crime scene investigation. Spoiler: the leaderboard might be lying to you.
The Leaderboard Illusion Paper
Connect with Emily Laird on LinkedIn

32,246 Listeners

536 Listeners

1,649 Listeners

56,944 Listeners

8,876 Listeners

175 Listeners

212 Listeners

27,584 Listeners

5,109 Listeners

10,254 Listeners

16,525 Listeners

1,788 Listeners

688 Listeners

112 Listeners

0 Listeners