Google AI: Release Notes

Gemini's Multimodality


Listen Later

Ani Baddepudi, Gemini Model Behavior Product Lead, joins host Logan Kilpatrick for a deep dive into Gemini's multimodal capabilities. Their conversation explores why Gemini was built as a natively multimodal model from day one, the future of proactive AI assistants, and how we are moving towards a world where "everything is vision." Learn about the differences between video and image understanding and token representations, higher FPS video sampling, and more.

 

Chapters:

0:00 - Intro
1:12 - Why Gemini is natively multimodal
2:23 - The technology behind multimodal models
5:15 - Video understanding with Gemini 2.5
9:25 - Deciding what to build next
13:23 - Building new product experiences with multimodal AI
17:15 - The vision for proactive assistants
24:13 - Improving video usability with variable FPS and frame tokenization
27:35 - What’s next for Gemini’s multimodal development
31:47 - Deep dive on Gemini’s document understanding capabilities
37:56 - The teamwork and collaboration behind Gemini
40:56 - What’s next with model behavior


Watch on YouTube: https://www.youtube.com/watch?v=K4vXvaRV0dw

...more
View all episodesView all episodes
Download on the App Store

Google AI: Release NotesBy Google AI

  • 5
  • 5
  • 5
  • 5
  • 5

5

7 ratings


More shows like Google AI: Release Notes

View all
Odd Lots by Bloomberg

Odd Lots

1,993 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,105 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

233 Listeners

The Diary Of A CEO with Steven Bartlett by DOAC

The Diary Of A CEO with Steven Bartlett

8,876 Listeners

The Best One Yet by Nick & Jack Studios

The Best One Yet

9,722 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

203 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,254 Listeners

Hard Fork by The New York Times

Hard Fork

5,576 Listeners

Call Me Back - with Dan Senor by Ark Media

Call Me Back - with Dan Senor

3,333 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

16,525 Listeners

The Artificial Intelligence Show by Paul Roetzer and Mike Kaput

The Artificial Intelligence Show

214 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

688 Listeners

The Markets by Goldman Sachs

The Markets

80 Listeners

Everyday AI Podcast – An AI and ChatGPT Podcast by Everyday AI

Everyday AI Podcast – An AI and ChatGPT Podcast

112 Listeners

AI + a16z by a16z

AI + a16z

34 Listeners