
Sign up to save your podcasts
Or
Ever wondered if AI leaderboards are as fair as they seem? In today's episode of A Beginner's Guide to AI, Professor GePhardT digs into the explosive controversy known as the "Leaderboard Illusion."
With the crowdsourced AI ranking platform LM Arena securing a jaw-dropping $100 million investment and skyrocketing its valuation to $600 million, AI evaluation has become serious business.
But recent revelations accuse big players like Meta of secretly gaming the rankings—testing multiple hidden versions of their AI models to artificially inflate their scores.
Join us as we unravel this juicy saga, discuss why fairness and transparency matter, and explore how these leaderboards can impact the AI landscape.
Tune in to get my thoughts, don't forget to subscribe to our Newsletter!
Want to get in contact? Write me an email: [email protected]
This podcast was generated with the help of ChatGPT and Mistral . We do fact-check with human eyes, but there still might be hallucinations in the output. And, by the way, it's read by an AI voice from ElevenLabs.
Music credit: "Modern Situations" by Unicorn Heads
3.3
2525 ratings
Ever wondered if AI leaderboards are as fair as they seem? In today's episode of A Beginner's Guide to AI, Professor GePhardT digs into the explosive controversy known as the "Leaderboard Illusion."
With the crowdsourced AI ranking platform LM Arena securing a jaw-dropping $100 million investment and skyrocketing its valuation to $600 million, AI evaluation has become serious business.
But recent revelations accuse big players like Meta of secretly gaming the rankings—testing multiple hidden versions of their AI models to artificially inflate their scores.
Join us as we unravel this juicy saga, discuss why fairness and transparency matter, and explore how these leaderboards can impact the AI landscape.
Tune in to get my thoughts, don't forget to subscribe to our Newsletter!
Want to get in contact? Write me an email: [email protected]
This podcast was generated with the help of ChatGPT and Mistral . We do fact-check with human eyes, but there still might be hallucinations in the output. And, by the way, it's read by an AI voice from ElevenLabs.
Music credit: "Modern Situations" by Unicorn Heads
161 Listeners
322 Listeners
146 Listeners
192 Listeners
106 Listeners
128 Listeners
141 Listeners
201 Listeners
462 Listeners
247 Listeners
94 Listeners
39 Listeners
28 Listeners
46 Listeners