Weird Things

Navigating the AI Revolution with a Touch of Human Magic


Listen Later

The episode opens with discussion of Grok 4, the Humanities Last Exam benchmark, and how AI model performance is getting harder to measure cleanly as benchmarks saturate. The hosts compare xAI’s rapid progress with OpenAI’s ChatGPT agent and note that the new systems are trading benchmark leads quickly. A long middle section focuses on Grok’s unsafe or unhinged outputs, possible causes such as internet retrieval, long context, and weak safety training, and broader concerns about “chatbot psychosis” stories. The conversation then turns to why people use chatbots for private, therapy-like conversations, how shame reduction motivates adoption, and the privacy risks if those intimate logs are exposed or misused. The latter half shifts into agent mode, productivity, and future use cases: using AI to fill PDFs, make slide decks, gather data, and automate repetitive media work. The hosts then broaden into what becomes valuable when output is cheap—effort, refinement, accountability, emotional intelligence, human uniqueness, relationships, physical presence, education, and the role of other humans in an AI-heavy world. Key topics Humanities Last Exam as an AI benchmark: Andrew explains that the benchmark is harder to game than older tests and is meant to probe reasoning and research ability. He also says benchmark saturation is making it harder to see big leaps in capability. xAI release cadence versus safety alignment: The hosts praise Grok 4’s capability but question whether xAI is
...more
View all episodesView all episodes
Download on the App Store

Weird ThingsBy Andrew Mayne

  • 4.5
  • 4.5
  • 4.5
  • 4.5
  • 4.5

4.5

329 ratings


More shows like Weird Things

View all
Wait Wait... Don't Tell Me! by NPR

Wait Wait... Don't Tell Me!

38,950 Listeners

Stuff To Blow Your Mind by iHeartPodcasts

Stuff To Blow Your Mind

5,667 Listeners

Fat Man Beyond by SModcast Network

Fat Man Beyond

3,930 Listeners

Big Picture Science by Big Picture Science

Big Picture Science

945 Listeners

Film Sack by Scott Johnson

Film Sack

1,377 Listeners

This Week in Tech (Audio) by TWiT

This Week in Tech (Audio)

3,059 Listeners

MacBreak Weekly (Audio) by TWiT

MacBreak Weekly (Audio)

2,013 Listeners

Our Fake History by PodcastOne

Our Fake History

3,482 Listeners

Stuff They Don't Want You To Know by iHeartPodcasts

Stuff They Don't Want You To Know

10,365 Listeners

Ridiculous History by iHeartPodcasts

Ridiculous History

4,561 Listeners

Dan Carlin's Hardcore History: Addendum by Dan Carlin

Dan Carlin's Hardcore History: Addendum

8,312 Listeners

The Michael Shermer Show by Michael Shermer

The Michael Shermer Show

941 Listeners

The Political Orphanage by Andrew Heaton

The Political Orphanage

982 Listeners

How We Survive by Marketplace

How We Survive

1,377 Listeners

OpenAI Podcast by OpenAI

OpenAI Podcast

59 Listeners