LessWrong posts by zvi

“The Gemini 1.5 Report” by Zvi


Listen Later

This post goes over the extensive report Google put out on Gemini 1.5.

There are no important surprises. Both Gemini Pro 1.5 and Gemini Flash are ‘highly capable multimodal models incorporating a novel mixture-of-experts architecture’ and various other improvements. They are solid models with solid performance. It can be useful and interesting to go over the details of their strengths and weaknesses.

The biggest thing to know is that Google improves its models incrementally and silently over time, so if you have not used Gemini in months, you might be underestimating what it can do.

I’m hitting send and then jumping on a plane to Berkeley. Perhaps I will see you there over the weekend. That means that if there are mistakes here, I will be slower to respond and correct them than usual, so consider checking the comments section.

Practical Questions First

The [...]

---

Outline:

(00:56) Practical Questions First

(03:51) Speed Kills

(04:44) Very Large Context Windows

(05:14) Relative Performance within the Gemini Family

(07:04) Gemini Flash and the Future Flash-8B

(08:21) New and Improved Evaluations

(14:57) Core Capability Evaluations

(18:14) Model Architecture and Training

(20:08) Safety, Security and Responsibility

(24:45) What Do We Want?

(26:02) Don’t You Know That You’re Toxic?

(28:32) Trying to be Helpful

(29:45) Security Issues

(31:33) Representational Harms

(33:17) Arms-Length Internal Assurance Evaluations

(35:01) External Evaluations

(35:46) Safety Overall

---

First published:

May 31st, 2024

Source:

https://www.lesswrong.com/posts/seM8aQ7Yy6m3i4QPx/the-gemini-1-5-report

---

Narrated by TYPE III AUDIO.

...more
View all episodesView all episodes
Download on the App Store

LessWrong posts by zviBy zvi

  • 5
  • 5
  • 5
  • 5
  • 5

5

2 ratings


More shows like LessWrong posts by zvi

View all
Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,375 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,424 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

1,092 Listeners

Future of Life Institute Podcast by Future of Life Institute

Future of Life Institute Podcast

107 Listeners

ChinaTalk by Jordan Schneider

ChinaTalk

288 Listeners

Politix by Politix

Politix

94 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

75 Listeners

Hard Fork by The New York Times

Hard Fork

5,469 Listeners

Clearer Thinking with Spencer Greenberg by Spencer Greenberg

Clearer Thinking with Spencer Greenberg

130 Listeners

LessWrong (Curated & Popular) by LessWrong

LessWrong (Curated & Popular)

13 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

130 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

153 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

503 Listeners

LessWrong (30+ Karma) by LessWrong

LessWrong (30+ Karma)

0 Listeners

Complex Systems with Patrick McKenzie (patio11) by Patrick McKenzie

Complex Systems with Patrick McKenzie (patio11)

133 Listeners