November 21, 2025

Gemini 3: Model Card and Safety Framework Report

24 minutes

Gemini 3 Pro is an excellent model, sir.

This is a frontier model release, so we start by analyzing the model card and safety framework report.

Then later I’ll look at capabilities.

I found the safety framework highly frustrating to read, as it repeatedly ‘hides the football’ and withholds or makes it difficult to understand key information.

I do not believe there is a frontier safety problem with Gemini 3, but (to jump ahead, I’ll go into more detail next time) I do think that the model is seriously misaligned in many ways, optimizing too much towards achieving training objectives. The training objectives can override the actual conversation. This leaves it prone to hallucinations, crafting narratives, glazing and to giving the user what it thinks the user will approve of rather than what is true, what the user actually asked for or would benefit from.

It is very much a Gemini model, perhaps the most Gemini model so far.

Gemini 3 Pro is an excellent model despite these problems, but one must be aware.

Gemini 3 Self-Portrait

Gemini 3 Facts

I already did my ‘Third Gemini’ jokes and I won’t [...]

---

Outline:

(01:26) Gemini 3 Facts

(02:35) On Your Marks

(03:27) Safety Third

(05:18) Frontier Safety Framework

(05:44) CBRN

(08:29) Cybersecurity

(09:47) Manipulation

(14:54) Machine Learning R&D

(16:55) Misalignment

(19:06) Chain of Thought Legibility

(19:25) Safety Mitigations

(21:56) They Close On This Not Troubling At All Note

(22:51) So, Is It Safe?

---

First published:

November 21st, 2025

Source:

https://www.lesswrong.com/posts/5s5NZ6txhHMmSRSNw/gemini-3-model-card-and-safety-framework-report

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

...more

View all episodes

By zvi

22 ratings