
Sign up to save your podcasts
Or


This post goes over the extensive report Google put out on Gemini 1.5.
There are no important surprises. Both Gemini Pro 1.5 and Gemini Flash are ‘highly capable multimodal models incorporating a novel mixture-of-experts architecture’ and various other improvements. They are solid models with solid performance. It can be useful and interesting to go over the details of their strengths and weaknesses.
The biggest thing to know is that Google improves its models incrementally and silently over time, so if you have not used Gemini in months, you might be underestimating what it can do.
I’m hitting send and then jumping on a plane to Berkeley. Perhaps I will see you there over the weekend. That means that if there are mistakes here, I will be slower to respond and correct them than usual, so consider checking the comments section.
Practical Questions First
The [...]
---
Outline:
(00:56) Practical Questions First
(03:51) Speed Kills
(04:44) Very Large Context Windows
(05:14) Relative Performance within the Gemini Family
(07:04) Gemini Flash and the Future Flash-8B
(08:21) New and Improved Evaluations
(14:57) Core Capability Evaluations
(18:14) Model Architecture and Training
(20:08) Safety, Security and Responsibility
(24:45) What Do We Want?
(26:02) Don’t You Know That You’re Toxic?
(28:32) Trying to be Helpful
(29:45) Security Issues
(31:33) Representational Harms
(33:17) Arms-Length Internal Assurance Evaluations
(35:01) External Evaluations
(35:46) Safety Overall
---
First published:
Source:
---
Narrated by TYPE III AUDIO.
By zvi5
22 ratings
This post goes over the extensive report Google put out on Gemini 1.5.
There are no important surprises. Both Gemini Pro 1.5 and Gemini Flash are ‘highly capable multimodal models incorporating a novel mixture-of-experts architecture’ and various other improvements. They are solid models with solid performance. It can be useful and interesting to go over the details of their strengths and weaknesses.
The biggest thing to know is that Google improves its models incrementally and silently over time, so if you have not used Gemini in months, you might be underestimating what it can do.
I’m hitting send and then jumping on a plane to Berkeley. Perhaps I will see you there over the weekend. That means that if there are mistakes here, I will be slower to respond and correct them than usual, so consider checking the comments section.
Practical Questions First
The [...]
---
Outline:
(00:56) Practical Questions First
(03:51) Speed Kills
(04:44) Very Large Context Windows
(05:14) Relative Performance within the Gemini Family
(07:04) Gemini Flash and the Future Flash-8B
(08:21) New and Improved Evaluations
(14:57) Core Capability Evaluations
(18:14) Model Architecture and Training
(20:08) Safety, Security and Responsibility
(24:45) What Do We Want?
(26:02) Don’t You Know That You’re Toxic?
(28:32) Trying to be Helpful
(29:45) Security Issues
(31:33) Representational Harms
(33:17) Arms-Length Internal Assurance Evaluations
(35:01) External Evaluations
(35:46) Safety Overall
---
First published:
Source:
---
Narrated by TYPE III AUDIO.

26,393 Listeners

2,464 Listeners

1,100 Listeners

109 Listeners

296 Listeners

89 Listeners

551 Listeners

5,553 Listeners

140 Listeners

14 Listeners

140 Listeners

155 Listeners

458 Listeners

0 Listeners

143 Listeners