
Sign up to save your podcasts
Or
Join Allen Firstenberg and Linda Lawton of Two Voice Devs as they record live from Google I/O 2025! As the conference neared the end, they dive deep into the groundbreaking announcements in generative AI, discussing the latest advancements and what they mean for developers, especially those in Conversational AI.
This episode explores the new and updated models that are set to redefine content creation:
Lyria: Google's innovative streaming audio generation API, its unique WebSocket-based approach, and the fascinating possibilities (and challenges!) of dynamic music creation, including its potential for YouTube content and the ever-present copyright questions surrounding AI-generated media.
Veo 3: The video generation powerhouse, now enhanced with synchronized audio and voice, realistic lip-sync for characters (yes, even cartoon animals!), and improvements in "world physics." They also tackle the implications of its pricing for professional and individual creators.
Imagen 4: Discover the highly anticipated improvements in text generation within images, including stylized fonts and potential for other languages.
Allen and Linda also share some early creations with these new models.
Whether you're building the next great voice app, creating dynamic content, or just curious about the cutting edge of AI, this episode offers a developer-focused perspective on the future of generative media.
00:00:00: Introduction to Two Voice Devs at I/O 2025
00:00:50: I/O 2025: New Generative AI Models Overview
00:01:20: Lyria: Streaming Audio Generation and Documentation Challenges
00:03:00: Lyria's Practical Use Cases & Generative AI Copyright Questions
00:10:00: Veo 3: Video Generation with Synchronized Audio and Voice Features
00:12:10: Veo 3 Pricing and Cost Implications for Developers
00:14:20: Imagen 4: Improved Text Generation in Images
00:17:40: Professional Use Cases for Veo and Imagen
00:19:10: Flow: The New Professional Studio System for Creators
00:22:00: Gemini Ultra Tiered Pricing and Regional Restrictions
00:24:20: Concluding Thoughts and Call to Action
#GoogleIO2025 #GenerativeAI #AIModels #Lyria #Veo3 #Imagen4 #FlowAI #TwoVoiceDevs #VoiceTech #ConversationalAI #AIDevelopment #MachineLearning #ContentCreation #YouTubeCreators #GoogleAI #VertexAI #GeminiUltra #CopyrightAI #TechPodcast
1
11 ratings
Join Allen Firstenberg and Linda Lawton of Two Voice Devs as they record live from Google I/O 2025! As the conference neared the end, they dive deep into the groundbreaking announcements in generative AI, discussing the latest advancements and what they mean for developers, especially those in Conversational AI.
This episode explores the new and updated models that are set to redefine content creation:
Lyria: Google's innovative streaming audio generation API, its unique WebSocket-based approach, and the fascinating possibilities (and challenges!) of dynamic music creation, including its potential for YouTube content and the ever-present copyright questions surrounding AI-generated media.
Veo 3: The video generation powerhouse, now enhanced with synchronized audio and voice, realistic lip-sync for characters (yes, even cartoon animals!), and improvements in "world physics." They also tackle the implications of its pricing for professional and individual creators.
Imagen 4: Discover the highly anticipated improvements in text generation within images, including stylized fonts and potential for other languages.
Allen and Linda also share some early creations with these new models.
Whether you're building the next great voice app, creating dynamic content, or just curious about the cutting edge of AI, this episode offers a developer-focused perspective on the future of generative media.
00:00:00: Introduction to Two Voice Devs at I/O 2025
00:00:50: I/O 2025: New Generative AI Models Overview
00:01:20: Lyria: Streaming Audio Generation and Documentation Challenges
00:03:00: Lyria's Practical Use Cases & Generative AI Copyright Questions
00:10:00: Veo 3: Video Generation with Synchronized Audio and Voice Features
00:12:10: Veo 3 Pricing and Cost Implications for Developers
00:14:20: Imagen 4: Improved Text Generation in Images
00:17:40: Professional Use Cases for Veo and Imagen
00:19:10: Flow: The New Professional Studio System for Creators
00:22:00: Gemini Ultra Tiered Pricing and Regional Restrictions
00:24:20: Concluding Thoughts and Call to Action
#GoogleIO2025 #GenerativeAI #AIModels #Lyria #Veo3 #Imagen4 #FlowAI #TwoVoiceDevs #VoiceTech #ConversationalAI #AIDevelopment #MachineLearning #ContentCreation #YouTubeCreators #GoogleAI #VertexAI #GeminiUltra #CopyrightAI #TechPodcast
1,332 Listeners
77 Listeners