Training Data

Meta’s Joe Spisak on Llama 3.1 405B and the Democratization of Frontier Models


Listen Later

As head of Product Management for Generative AI at Meta, Joe Spisak leads the team behind Llama, which just released the new 3.1 405B model. We spoke with Joe just two days after the model’s release to ask what’s new, what it enables, and how Meta sees the role of open source in the AI ecosystem.


Joe shares that where Llama 3.1 405B really focused is on pushing scale (it was trained on 15 trillion tokens using 16,000 GPUs) and he’s excited about the zero-shot tool use it will enable, as well as its role in distillation and generating synthetic data to teach smaller models. He tells us why he thinks even frontier models will ultimately commoditize—and why that’s a good thing for the startup ecosystem.


Hosted by: Stephanie Zhan and Sonya Huang, Sequoia Capital 


Mentioned in this episode: 

Llama 3.1 405B paper

Open Source AI Is the Way Forward: Mark Zuckerberg essay released with Llama 3.1.

Mistral Large 2

The Bitter Lesson by Rich Sutton


00:00 Introduction

01:28 The Llama 3.1 405B launch

05:02 The open source license

07:01 What's in it for Meta?

10:19 Why not open source?

11:16 Will frontier models commoditize?

12:41 What about startups?

16:29 The Mistral team

19:36 Are all frontier strategies comparable?

22:38 Is model development becoming more like software development?

26:34 Agentic reasoning

29:09 What future levers will unlock reasoning?

31:20 Will coding and math lead to unlocks?

33:09 Small models

34:08 7X more data

37:36 Are we going to hit a wall?

39:49 Lightning round

...more
View all episodesView all episodes
Download on the App Store

Training DataBy Sequoia Capital

  • 4.2
  • 4.2
  • 4.2
  • 4.2
  • 4.2

4.2

36 ratings


More shows like Training Data

View all
This Week in Startups by Jason Calacanis

This Week in Startups

1,283 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

1,080 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

527 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

221 Listeners

Practical AI by Practical AI LLC

Practical AI

206 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

88 Listeners

Grit by Kleiner Perkins

Grit

189 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

457 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

130 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

96 Listeners

Crucible Moments by Sequoia Capital

Crucible Moments

91 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

482 Listeners

AI + a16z by a16z

AI + a16z

31 Listeners

Lightcone Podcast by Y Combinator

Lightcone Podcast

17 Listeners

Uncapped with Jack Altman by Alt Capital

Uncapped with Jack Altman

41 Listeners

Cheeky Pint by Stripe

Cheeky Pint

16 Listeners