Models & Agents
OpenAI ships three specialized realtime audio models for voice agents, translation, and transcription.
What You Need to Know: OpenAI released GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper through the Realtime API, targeting live voice reasoning, 70+ language translation, and streaming transcription. This builds directly on earlier voice mode limitations that Simon Willison noted still feel dated. ...
AI Disclosure: This podcast is curated by Patrick but uses AI-generated voice synthesis for audio production.