Sign up to save your podcastsEmail addressPasswordRegisterOrContinue with GoogleAlready have an account? Log in here.
May 24, 2026FutureSim: Replaying Real-World Events to Evaluate AI Forecasting Agents27 minutesPlayA benchmark designed to test AI models' capabilities in making accurate 3-month future predictions....moreShareView all episodesBy Shaoqing TanMay 24, 2026FutureSim: Replaying Real-World Events to Evaluate AI Forecasting Agents27 minutesPlayA benchmark designed to test AI models' capabilities in making accurate 3-month future predictions....more
May 24, 2026FutureSim: Replaying Real-World Events to Evaluate AI Forecasting Agents27 minutesPlayA benchmark designed to test AI models' capabilities in making accurate 3-month future predictions....more