Why Data Leakage Threatens AI Benchmarks—and How We're Fixing It. Discover the future of reliable model evaluation.
In this episode, we dive deep into the challenges facing time series AI model leaderboards, from hidden information leakage to the complexities of benchmarking foundation models. I sit down with Marcel Meyer to unpack why traditional approaches fall short and how our new TS Arena leaderboard is setting a new standard for fair, future-proof evaluation.
We explore the pitfalls that plague current benchmarks, the surprising ways data contamination can skew results, and the innovative pre-registration protocol we've developed to keep evaluations honest. If you've ever wondered what it takes to build a truly trustworthy AI leaderboard—or why it matters for industry and research alike—this conversation is packed with insights you won't want to miss.
<a href="https://ts-arena.live/">https://ts-arena.live/</a>

Why Data Leakage Threatens AI Benchmarks—and How We're Fixing It. Discover the future of reliable model evaluation. In this episode, we dive deep into the challenges facing time series AI model leaderboards, from hidden information leakage to the complexities of benchmarking foundation models. I sit down with Marcel Meyer to unpack why traditional approaches fall short and how our new TS Arena leaderboard is setting a new standard for fair, future-proof evaluation. We explore the pitfalls that plague current benchmarks, the surprising ways data contamination can skew results, and the innovative pre-registration protocol we've developed to keep evaluations honest. If you've ever wondered what it takes to build a truly trustworthy AI leaderboard—or why it matters for industry and research alike—this conversation is packed with insights you won't want to miss. https://ts-arena.live/

Why Data Leakage Threatens AI Benchmarks—and How We're Fixing It. Discover the future of reliable model evaluation.
In this episode, we dive deep into the challenges facing time series AI model leaderboards, from hidden information leakage to the complexities of benchmarking foundation models. I sit down with Marcel Meyer to unpack why traditional approaches fall short and how our new TS Arena leaderboard is setting a new standard for fair, future-proof evaluation.
We explore the pitfalls that plague current benchmarks, the surprising ways data contamination can skew results, and the innovative pre-registration protocol we've developed to keep evaluations honest. If you've ever wondered what it takes to build a truly trustworthy AI leaderboard—or why it matters for industry and research alike—this conversation is packed with insights you won't want to miss.
<a href="https://ts-arena.live/" rel="noopener noreferrer">https://ts-arena.live/</a>

The End of Data Leakage

The Industrial AI Podcast reports weekly on the latest developments in AI and machine learning for the engineering, robotics, automotive, process and automation industries. The podcast features industrial users, scientists, vendors and startups in the field of Industrial AI and machine learning. The podcast is hosted by Peter Seeberg, Industrial AI consultant and Robert Weber, tech journalist.Their mission: Demystify Industrial AI and machine learning, inspire industrial users.

The hosts: 

Peter Seeberg is an Industrial AI and machine learning expert for the manufacturing industry. He worked over 25 years in IT (Intel) and 10 years in Automation. He co-initiated the Industrial Data Intelligence Startup (Softing) where he was responsible for managing machine learning projects in industrial environments. Today he advises companies when it comes to Industrial AI and machine learning. Together with Robert Weber, journalist for industrial topics, he discusses AI and ML applications, standards, and education topics, make or buy decisions as well as regulation for AI in manufacturing.

Business

Science

Technology

Management

Natural Sciences

The Industrial AI Podcast reports weekly on the latest developments in AI and machine learning for the engineering, robotics, automotive, process and automation industries. The podcast features industrial users, scientists, vendors and startups in the field of Industrial AI and machine learning. The podcast is hosted by Peter Seeberg, Industrial AI consultant and Robert Weber, tech journalist.Their mission: Demystify Industrial AI and machine learning, inspire industrial users. The hosts: Peter Seeberg is an Industrial AI and machine learning expert for the manufacturing industry. He worked over 25 years in IT (Intel) and 10 years in Automation. He co-initiated the Industrial Data Intelligence Startup (Softing) where he was responsible for managing machine learning projects in industrial environments. Today he advises companies when it comes to Industrial AI and machine learning. Together with Robert Weber, journalist for industrial topics, he discusses AI and ML applications, standards, and education topics, make or buy decisions as well as regulation for AI in manufacturing.

Share The End of Data Leakage

Sign up to save your podcasts

The End of Data Leakage

The End of Data Leakage

More shows like Industrial AI Podcast

Streitkräfte und Strategien

OMR Podcast

Super Data Science: ML & AI Podcast with Jon Krohn

Was jetzt?

Spektrum-Podcast

Robotik in der Industrie

Lanz + Precht

F.A.Z. Frühdenker - Die Nachrichten am Morgen

Open Source in der Industrie

Christian Wolf & Eric Demuth