Andrew and Justin spend most of the episode comparing OpenAI's GPT-4o rollout with Google's AI announcements. They describe GPT-4o as a multimodal system that combines text, image, sound, and voice into one model, and emphasize that OpenAI's live demos felt fast, real-time, and more transparent than Google's earlier staged or prerecorded presentations. They also discuss latency, the shift from separate speech/transcription models to a single model, and how native desktop and mobile apps, along with ChatGPT for Enterprise, fit OpenAI's product strategy. The second half of the conversation broadens into release timing, safety, secrecy, and organizational interpretation. Andrew says OpenAI often holds capabilities back for safety or product timing, that people overread departures and rumors, and that claims about AI hitting a wall are premature given the field's growth. The episode closes with Andrew recommending AMC's Interview with the Vampire and Justin offering OpenAI's YouTube channel as a place to watch the demos. Key topics GPT-4o as a multimodal model: Andrew explains GPT-4o as one model that can handle text, images, sound, and voice together, replacing a pipeline of separate transcription, language, and speech systems. Live demos versus prerecorded demos: The hosts contrast OpenAI's live GPT-4o presentation with Google's earlier Gemini/Astra demos, which Andrew criticizes as controlled, prerecorded, or misleading. Latency and real-time interaction: A recurring point is