Google AI: Release Notes

How a Moonshot Led to Google DeepMind's Veo 3


Listen Later

Dumi Erhan, co-lead of the Veo project at Google DeepMind, joins host Logan Kilpatrick for a deep dive into the evolution of generative video models. They discuss the journey from early research in 2018 to the launch of state-of-the-art Veo 3 model with native audio generation. Learn about the technical hurdles in evaluating and scaling video models, the challenges of long-duration video coherence and how user feedback is shaping the future of AI-powered video creation.

Chapter: 
0:00 - Intro
0:47 - Veo project's beginnings
3:02 - Veo's origins in Google Brain
5:07 - Video prediction and robotics applications
7:45 - Early progress and evaluation challenges
10:30 - Physics-based evaluations and their limitations
12:18 - The launch of the original Veo model
14:06 - Scaling challenges for video models
16:02 - The leap from Veo1 to Veo2
19:40 - Veo 3’s viral audio moment
21:17 - User trends shaping Veo's roadmap
23:49 - Image-to-video vs. text-to-video complexity
26:00 - New prompting methods and user control
27:55 - Coherence in long video generation
31:03 - Genie 3 and world models
35:54 - The steerability challenge
41:59 - Capability transfer and image data's role
47:25 - Closing

 

...more
View all episodesView all episodes
Download on the App Store

Google AI: Release NotesBy Google AI

  • 5
  • 5
  • 5
  • 5
  • 5

5

7 ratings


More shows like Google AI: Release Notes

View all
Odd Lots by Bloomberg

Odd Lots

1,989 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,096 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

234 Listeners

The Diary Of A CEO with Steven Bartlett by DOAC

The Diary Of A CEO with Steven Bartlett

8,748 Listeners

The Best One Yet by Nick & Jack Studios

The Best One Yet

9,718 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

201 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,204 Listeners

Hard Fork by The New York Times

Hard Fork

5,587 Listeners

Call Me Back - with Dan Senor by Ark Media, Ilan Benatar

Call Me Back - with Dan Senor

3,287 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,487 Listeners

The Artificial Intelligence Show by Paul Roetzer and Mike Kaput

The Artificial Intelligence Show

208 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

675 Listeners

The Markets by Goldman Sachs

The Markets

86 Listeners

Everyday AI Podcast – An AI and ChatGPT Podcast by Everyday AI

Everyday AI Podcast – An AI and ChatGPT Podcast

110 Listeners

AI + a16z by a16z

AI + a16z

32 Listeners