March 21, 2024

Speed will win the AI computing battle with Tuhin Srivastava from Baseten

Listen Later

38 minutes

At a time when users are being asked to wait unthinkable seconds for AI products to generate art and answers, speed is what will win the battle heating up in AI computing. At least according to today’s guest, Tuhin Srivastava, the CEO and co-founder of Baseten which gives customers scalable AI infrastructures starting with interference. In this episode of No Priors, Sarah, Elad, and Tuhin discuss why efficient code solutions are more desirable than no code, the most surprising use cases for Baseten, and why all of their jobs are very defensible from AI.

Show Links:

Baseten

Benchmarking fast Mistral 7B inference

Sign up for new podcasts every week. Email feedback to [email protected]

Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @tuhinone

Show Notes:

(0:00) Introduction

(1:19) Capabilities of efficient code enabled development

(4:11) Difference in training inference workloads

(6:12) AI product acceleration

(8:48) Leading on inference benchmarks at Baseten

(12:08) Optimizations for different types of models

(16:11) Internal vs open source models

(19:01) timeline for enterprise scale

(21:53) Rethinking investment in compute spend

(27:50) Defensibility in AI industries

(31:30) Hardware and the chip shortage

(35:47) Speed is the way to win in this industry

(38:26) Wrap

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

No Priors: Artificial Intelligence | Technology | Startups

By Conviction

4.3

124124 ratings

March 21, 2024

Speed will win the AI computing battle with Tuhin Srivastava from Baseten

Listen Later

38 minutes

At a time when users are being asked to wait unthinkable seconds for AI products to generate art and answers, speed is what will win the battle heating up in AI computing. At least according to today’s guest, Tuhin Srivastava, the CEO and co-founder of Baseten which gives customers scalable AI infrastructures starting with interference. In this episode of No Priors, Sarah, Elad, and Tuhin discuss why efficient code solutions are more desirable than no code, the most surprising use cases for Baseten, and why all of their jobs are very defensible from AI.

Show Links:

Baseten

Benchmarking fast Mistral 7B inference

Sign up for new podcasts every week. Email feedback to [email protected]

Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @tuhinone

Show Notes:

(0:00) Introduction

(1:19) Capabilities of efficient code enabled development

(4:11) Difference in training inference workloads

(6:12) AI product acceleration

(8:48) Leading on inference benchmarks at Baseten

(12:08) Optimizations for different types of models

(16:11) Internal vs open source models

(19:01) timeline for enterprise scale

(21:53) Rethinking investment in compute spend

(27:50) Defensibility in AI industries

(31:30) Hardware and the chip shortage

(35:47) Speed is the way to win in this industry

(38:26) Wrap

...more

More shows like No Priors: Artificial Intelligence | Technology | Startups

This Week in Startups by Jason Calacanis

This Week in Startups

1,286 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

534 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,095 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

438 Listeners

Invest Like the Best with Patrick O'Shaughnessy by Colossus | Investing & Business Podcasts

Invest Like the Best with Patrick O'Shaughnessy

2,346 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

225 Listeners

Practical AI by Practical AI LLC

Practical AI

198 Listeners

Last Week in AI by Skynet Today

Last Week in AI

311 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

95 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

531 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

98 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

473 Listeners

AI + a16z by a16z

AI + a16z

34 Listeners

TBPN by John Coogan & Jordi Hays

TBPN

122 Listeners

Uncapped with Jack Altman by Alt Capital

Uncapped with Jack Altman

41 Listeners