Generative AI 101

BrowseComp vs The Bots that Bluff


Listen Later

Can AI actually read the internet, or is it just faking it with confidence? In this high-voltage episode, host Emily Laird cracks open BrowseComp, OpenAI’s benchmark built to test whether web-browsing agents can find facts that are hard to uncover but easy to verify. Humans had two hours per question and still bailed most of the time, so what does it mean when a model claims victory? From compute budgets and canary strings to the rise of multimodal chaos, Emily exposes the difference between sounding right and being right, and why in an era of polished, source-backed answers, persistence beats plausible every time.


Join the AI Weekly Meetups

Connect with Us: If you enjoyed this episode or have questions, reach out to Emily Laird on LinkedIn. Stay tuned for more insights into the evolving world of generative AI. And remember, you now know more about the BrowseComp benchmark.


Connect with Emily Laird on LinkedIn

...more
View all episodesView all episodes
Download on the App Store

Generative AI 101By Emily Laird

  • 4.6
  • 4.6
  • 4.6
  • 4.6
  • 4.6

4.6

20 ratings


More shows like Generative AI 101

View all
Freakonomics Radio by Freakonomics Radio + Stitcher

Freakonomics Radio

32,272 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

540 Listeners

WSJ Tech News Briefing by The Wall Street Journal

WSJ Tech News Briefing

1,649 Listeners

Up First from NPR by NPR

Up First from NPR

56,833 Listeners

The Diary Of A CEO with Steven Bartlett by DOAC

The Diary Of A CEO with Steven Bartlett

8,827 Listeners

Cybersecurity Today by Jim Love

Cybersecurity Today

177 Listeners

Practical AI by Practical AI LLC

Practical AI

215 Listeners

On Purpose with Jay Shetty by iHeartPodcasts

On Purpose with Jay Shetty

27,653 Listeners

Cautionary Tales with Tim Harford by Pushkin Industries

Cautionary Tales with Tim Harford

5,129 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,222 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,437 Listeners

Networth and Chill with Your Rich BFF by Vivian Tu

Networth and Chill with Your Rich BFF

1,797 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

666 Listeners

Everyday AI Podcast – An AI and ChatGPT Podcast by Everyday AI

Everyday AI Podcast – An AI and ChatGPT Podcast

108 Listeners

Generative AI Basics by Anand V

Generative AI Basics

0 Listeners