Sign up to save your podcastsEmail addressPasswordRegisterOrContinue with GoogleAlready have an account? Log in here.
March 05, 2026Humanity's Last Exam: The Test AI Keeps Failing40 minutesPlay2,500 questions no AI can Google. GPT-4o scored 2.7%, humans hit 90%. Inside the hardest AI benchmark and its 30% error rate....moreShareView all episodesBy SigmaticMarch 05, 2026Humanity's Last Exam: The Test AI Keeps Failing40 minutesPlay2,500 questions no AI can Google. GPT-4o scored 2.7%, humans hit 90%. Inside the hardest AI benchmark and its 30% error rate....more
2,500 questions no AI can Google. GPT-4o scored 2.7%, humans hit 90%. Inside the hardest AI benchmark and its 30% error rate.
March 05, 2026Humanity's Last Exam: The Test AI Keeps Failing40 minutesPlay2,500 questions no AI can Google. GPT-4o scored 2.7%, humans hit 90%. Inside the hardest AI benchmark and its 30% error rate....more
2,500 questions no AI can Google. GPT-4o scored 2.7%, humans hit 90%. Inside the hardest AI benchmark and its 30% error rate.