AI Deep Dive

63: When AI Builds 3D Worlds, Personalizes Everything, and Then the Bill Arrives


Listen Later

The AI news cycle has split into three simultaneous revolutions: persistent 3D world models, hyper-personalized LLMs, and an infrastructure arms race that’s costing billions. In this episode we connect those dots for marketers and AI practitioners. We unpack Fei-Fei Li’s World Labs and Marble, an editable 3D environment generator that creates persistent scenes from text, images, video or existing layouts and exports as Gaussian splats, meshes or video—unlocking fast imports to game, VFX, VR, robotics training and architectural visualization. We explain why Gaussian splats matter for real-world speed and workflow integration.
Then we shift to personalization: OpenAI’s GPT‑5.1 focuses on steerability over headline-busting benchmarks with Instant and Thinking flavors plus eight personality presets (Default, Professional, Friendly, Candid, Quirky, Efficient, Nerdy, Cynical) and tunings for emoji and warmth—making models feel like branded collaborators. Against that, Baidu’s open-source Ernie 4.5 VL28B shows efficiency can beat brute force: a 28B model that sparsely activates ~3B parameters and dynamically “thinks with images,” proving cost-efficient architectures can undercut scale-for-scale approaches.
All of this runs on massive compute. OpenAI reportedly spent $5.02B on Azure in H1 2025 for inference alone; Anthropic is planning a $50B U.S. infrastructure build; Microsoft is doubling data center capacity with million+ square-foot facilities filled with hundreds of thousands of GPUs. The legal layer is heating up too: a judge ordered 20 million anonymized ChatGPT conversations to the New York Times (an earlier request sought 1.4B), spotlighting tensions between discovery and user confidentiality.
We finish with practical playbooks: how to use ChatGPT Projects for private new-hire onboarding (sample kickoff prompt that forces clarifying questions), and an elegant Zapier-agent workflow from a data manager that creates tiny report-specific AIs routed by a classifier so marketing gets verified, page-level answers in seconds. The takeaway: AI is rapidly becoming multimodal, persistent and personalized—but the competition is now about efficiency and cost, and a paradox remains. Experts expect benchmarks to match or beat humans by 2027–28, yet long-tail reliability failures will likely keep everyday tasks brittle through 2029. For marketers and builders, the imperative is clear: adopt spatial and personalization tools now, design for long-tail failure modes, and budget for the real cost of keeping these systems running.
...more
View all episodesView all episodes
Download on the App Store

AI Deep DiveBy Pete Larkin