The episode opens with the hosts talking about new live multimodal AI features in ChatGPT and Google Gemini, including Andrew's demo of showing ChatGPT a card trick over live video. They note that these features had been demonstrated earlier and are now shipping, but emphasize that backend compute, server connections, and GPU supply make rollout slower than some people expect. Most of the episode is spent on OpenAI's Sora and other video generators. The hosts discuss how to use Sora, including starting from a strong image or uploaded video, using storyboards, keeping generations short, trying lower resolutions first, using remix tools, and learning from the featured/recent feeds. They repeatedly stress current limitations in physical reasoning, object relationships, and variable binding, while also praising Sora for b-roll, companion footage, character coherence, and other creative uses. The episode closes with a short TV-picks segment covering Lower Decks, Strange New Worlds, Foundation, and Skeleton Crew. Key topics Infrastructure limits behind live AI features: Andrew says live video and multimodal AI depend on significant backend compute and connections, so they are not just app updates and cannot be scaled instantly. AI as a supportive audience member: In the live card-trick demo, the hosts are struck by how ChatGPT behaves like an encouraging spectator, staying supportive rather than immediately correcting the trick. Using prompts, storyboards, and reference images in S