August 27, 2025

Behind the scenes of Google's state-of-the-art "nano-banana" image model

Listen Later

30 minutes

Join host Logan Kilpatrick in discussion with some of the minds behind Google's new state-of-the-art image model, Gemini 2.5 Flash. Product and research leads from the Gemini team break down the technology behind its key capabilities, including interleaved generation for complex edits and new approaches to achieving character consistency and pixel-perfect control. With Nicole Brichtova, Kaushik Shivakumar, Mostafa Dehghani and Robert Riachi.

Watch on YouTube:

Chapters:
0:37 - New model introduction
1:21 -Demo - Image Editing
3:44 - Text rendering capabilities
4:44 Beyond human preference evals
6:44 - Text rendering as a proxy for quality
8:38 - Positive transfer between modalities
11:25 - Demo - Multi-turn, context aware image generation
13:54 - Pixel-perfect editing and character consistency
15:51 - Interleaved image generation
17:59 - Specialized vs. native models
19:52 - Understanding nuanced prompts
20:59 - User feedback shaping model development
22:37 - Improvements in character consistency
24:17 - More natural looking images from team collaboration
26:41 - What’s next for image generation models

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Google AI: Release Notes

By Google AI

5

66 ratings

August 27, 2025

Behind the scenes of Google's state-of-the-art "nano-banana" image model

Listen Later

30 minutes

Join host Logan Kilpatrick in discussion with some of the minds behind Google's new state-of-the-art image model, Gemini 2.5 Flash. Product and research leads from the Gemini team break down the technology behind its key capabilities, including interleaved generation for complex edits and new approaches to achieving character consistency and pixel-perfect control. With Nicole Brichtova, Kaushik Shivakumar, Mostafa Dehghani and Robert Riachi.

Watch on YouTube:

Chapters:
0:37 - New model introduction
1:21 -Demo - Image Editing
3:44 - Text rendering capabilities
4:44 Beyond human preference evals
6:44 - Text rendering as a proxy for quality
8:38 - Positive transfer between modalities
11:25 - Demo - Multi-turn, context aware image generation
13:54 - Pixel-perfect editing and character consistency
15:51 - Interleaved image generation
17:59 - Specialized vs. native models
19:52 - Understanding nuanced prompts
20:59 - User feedback shaping model development
22:37 - Improvements in character consistency
24:17 - More natural looking images from team collaboration
26:41 - What’s next for image generation models

...more

More shows like Google AI: Release Notes

a16z Podcast by Andreessen Horowitz

a16z Podcast

1,091 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

341 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

236 Listeners

Practical AI by Practical AI LLC

Practical AI

212 Listeners

Google DeepMind: The Podcast by Hannah Fry

Google DeepMind: The Podcast

197 Listeners

Big Technology Podcast by Alex Kantrowitz

Big Technology Podcast

476 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

131 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

95 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

210 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

588 Listeners

AI For Humans: Making Artificial Intelligence Fun & Practical by Kevin Pereira & Gavin Purcell

AI For Humans: Making Artificial Intelligence Fun & Practical

268 Listeners

Lightcone Podcast by Y Combinator

Lightcone Podcast

22 Listeners

Training Data by Sequoia Capital

Training Data

39 Listeners

OpenAI Podcast by OpenAI

OpenAI Podcast

52 Listeners

Cheeky Pint by Stripe

Cheeky Pint

43 Listeners