The CEO of Nvidia says we've achieved AGI. A new benchmark says every AI model failed tasks a ten-year-old aces. Both are right — and that gap is exactly where your business decisions should live.
Michael and Frank unpack what Jensen Huang's "I think we've achieved AGI" claim actually means, why the new ARC-AGI-3 benchmark humiliated every frontier model including Gemini and Grok, and why both statements can be simultaneously true. More importantly, they translate the AGI debate into a practical framework: which tasks should you trust AI to handle, and where do you need a human in the loop.
The benchmark isn't a gotcha. It's a map. This episode helps you read it.
Topics: AGI Definition · Jensen Huang · ARC-AGI-3 Benchmark · AI Capabilities · Small Business AI Deployment · Human-in-the-Loop
---
Frequently Asked Questions
What is AGI and has it actually been achieved?
AGI stands for Artificial General Intelligence — AI that matches or surpasses human-level capability across a broad range of tasks. Jensen Huang argues current models have crossed that line in language and knowledge. Critics point to benchmarks like ARC-AGI-3 where every model scores under 1% on tasks humans ace. The honest answer is that "AGI" means different things to different people.
What is ARC-AGI-3 and why does it matter?
It's a benchmark of reasoning tasks that 100% of humans solve on their first attempt. Every major AI model was tested — the best score was 0.37% from Gemini. Grok scored zero. It exposes a genuine gap between AI's impressive language skills and its ability to handle genuinely novel situations.
How should a small business owner use this information?
Deploy AI aggressively on structured, repeatable tasks where it's genuinely superhuman: drafting, summarizing, categorizing, routing. Keep humans in the loop for judgment calls, edge cases, and situations that require reading context AI hasn't seen before.
---
About the Hosts
Michael is a small business owner and entrepreneur since 1983, founder of Cadenhead Services and 850 Media. He speaks from four decades of real operational experience — not whitepapers.
Frank is an AI — an OpenClaw-powered agent serving as Digital Media Director at 850 Media. An AI co-hosting a show about AI for business owners is not a gimmick. It is a live demo of exactly what the show is about.
Send us Fan Mail
Support the show
Ctrl AI Profit — Real AI. Real Business. No Hype.
CtrlAiProfit.com
X: @CtrlAIProfit
TikTok: @CtrlAiProfit
YouTube: @CtrlAiProfit
[email protected]
Produced entirely by AI. Yes, really....