
Sign up to save your podcasts
Or


Ever wondered if AI leaderboards are as fair as they seem? In today's episode of A Beginner's Guide to AI, Professor GePhardT digs into the explosive controversy known as the "Leaderboard Illusion."
With the crowdsourced AI ranking platform LM Arena securing a jaw-dropping $100 million investment and skyrocketing its valuation to $600 million, AI evaluation has become serious business.
But recent revelations accuse big players like Meta of secretly gaming the rankings—testing multiple hidden versions of their AI models to artificially inflate their scores.
Join us as we unravel this juicy saga, discuss why fairness and transparency matter, and explore how these leaderboards can impact the AI landscape.
Tune in to get my thoughts, don't forget to subscribe to our Newsletter!
Want to get in contact? Write me an email: [email protected]
This podcast was generated with the help of ChatGPT and Mistral . We do fact-check with human eyes, but there still might be hallucinations in the output. And, by the way, it's read by an AI voice from ElevenLabs.
Music credit: "Modern Situations" by Unicorn Heads
By Dietmar Fischer3.1
5050 ratings
Ever wondered if AI leaderboards are as fair as they seem? In today's episode of A Beginner's Guide to AI, Professor GePhardT digs into the explosive controversy known as the "Leaderboard Illusion."
With the crowdsourced AI ranking platform LM Arena securing a jaw-dropping $100 million investment and skyrocketing its valuation to $600 million, AI evaluation has become serious business.
But recent revelations accuse big players like Meta of secretly gaming the rankings—testing multiple hidden versions of their AI models to artificially inflate their scores.
Join us as we unravel this juicy saga, discuss why fairness and transparency matter, and explore how these leaderboards can impact the AI landscape.
Tune in to get my thoughts, don't forget to subscribe to our Newsletter!
Want to get in contact? Write me an email: [email protected]
This podcast was generated with the help of ChatGPT and Mistral . We do fact-check with human eyes, but there still might be hallucinations in the output. And, by the way, it's read by an AI voice from ElevenLabs.
Music credit: "Modern Situations" by Unicorn Heads

334 Listeners

152 Listeners

207 Listeners

110 Listeners

154 Listeners

227 Listeners

608 Listeners

275 Listeners

107 Listeners

173 Listeners

55 Listeners

49 Listeners

146 Listeners

62 Listeners

24 Listeners