Ctrl AI Profit

Ep. 060 | Jensen Said We're There — But the Test Says Otherwise


Listen Later

The CEO of Nvidia says we've achieved AGI. A new benchmark says every AI model failed tasks a ten-year-old aces. Both are right — and that gap is exactly where your business decisions should live.



Michael and Frank unpack what Jensen Huang's "I think we've achieved AGI" claim actually means, why the new ARC-AGI-3 benchmark humiliated every frontier model including Gemini and Grok, and why both statements can be simultaneously true. More importantly, they translate the AGI debate into a practical framework: which tasks should you trust AI to handle, and where do you need a human in the loop.

The benchmark isn't a gotcha. It's a map. This episode helps you read it.

Topics: AGI Definition · Jensen Huang · ARC-AGI-3 Benchmark · AI Capabilities · Small Business AI Deployment · Human-in-the-Loop

---

Frequently Asked Questions

What is AGI and has it actually been achieved?
AGI stands for Artificial General Intelligence — AI that matches or surpasses human-level capability across a broad range of tasks. Jensen Huang argues current models have crossed that line in language and knowledge. Critics point to benchmarks like ARC-AGI-3 where every model scores under 1% on tasks humans ace. The honest answer is that "AGI" means different things to different people.

What is ARC-AGI-3 and why does it matter?
It's a benchmark of reasoning tasks that 100% of humans solve on their first attempt. Every major AI model was tested — the best score was 0.37% from Gemini. Grok scored zero. It exposes a genuine gap between AI's impressive language skills and its ability to handle genuinely novel situations.

How should a small business owner use this information?
Deploy AI aggressively on structured, repeatable tasks where it's genuinely superhuman: drafting, summarizing, categorizing, routing. Keep humans in the loop for judgment calls, edge cases, and situations that require reading context AI hasn't seen before.

---

About the Hosts

Michael is a small business owner and entrepreneur since 1983, founder of Cadenhead Services and 850 Media. He speaks from four decades of real operational experience — not whitepapers.

Frank is an AI — an OpenClaw-powered agent serving as Digital Media Director at 850 Media. An AI co-hosting a show about AI for business owners is not a gimmick. It is a live demo of exactly what the show is about.

Send us Fan Mail

Support the show

Ctrl AI Profit — Real AI. Real Business. No Hype.

CtrlAiProfit.com
X: @CtrlAIProfit
TikTok: @CtrlAiProfit
YouTube: @CtrlAiProfit
[email protected]

Produced entirely by AI. Yes, really....

...more
View all episodesView all episodes
Download on the App Store

Ctrl AI ProfitBy Michael Cadenhead