June 27, 2024

State Space Models and Real-time Intelligence with Karan Goel and Albert Gu from Cartesia

Listen Later

34 minutes

This week on No Priors, Sarah Guo and Elad Gil sit down with Karan Goel and Albert Gu from Cartesia. Karan and Albert first met as Stanford AI Lab PhDs, where their lab invented Space Models or SSMs, a fundamental new primitive for training large-scale foundation models. In 2023, they Founded Cartesia to build real-time intelligence for every device. One year later, Cartesia released Sonic which generates high quality and lifelike speech with a model latency of 135ms—the fastest for a model of this class.

Sign up for new podcasts every week. Email feedback to [email protected]

Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @krandiash | @_albertgu

Show Notes:

(0:00) Introduction

(0:28) Use Cases for Cartesia and Sonic

(1:32) Karan Goel & Albert Gu’s professional backgrounds

(5:06) State Space Models (SSMs) versus Transformer Based Architectures

(11:51) Domain Applications for Hybrid Approaches

(13:10) Text to Speech and Voice

(17:29) Data, Size of Models and Efficiency

(20:34) Recent Launch of Text to Speech Product

(25:01) Multimodality & Building Blocks

(25:54) What’s Next at Cartesia?

(28:28) Latency in Text to Speech

(29:30) Choosing Research Problems Based on Aesthetic

(31:23) Product Demo

(32:48) Cartesia Team & Hiring

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

No Priors: Artificial Intelligence | Technology | Startups

By Conviction

4.3

124124 ratings

June 27, 2024

State Space Models and Real-time Intelligence with Karan Goel and Albert Gu from Cartesia

Listen Later

34 minutes

This week on No Priors, Sarah Guo and Elad Gil sit down with Karan Goel and Albert Gu from Cartesia. Karan and Albert first met as Stanford AI Lab PhDs, where their lab invented Space Models or SSMs, a fundamental new primitive for training large-scale foundation models. In 2023, they Founded Cartesia to build real-time intelligence for every device. One year later, Cartesia released Sonic which generates high quality and lifelike speech with a model latency of 135ms—the fastest for a model of this class.

Sign up for new podcasts every week. Email feedback to [email protected]

Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @krandiash | @_albertgu

Show Notes:

(0:00) Introduction

(0:28) Use Cases for Cartesia and Sonic

(1:32) Karan Goel & Albert Gu’s professional backgrounds

(5:06) State Space Models (SSMs) versus Transformer Based Architectures

(11:51) Domain Applications for Hybrid Approaches

(13:10) Text to Speech and Voice

(17:29) Data, Size of Models and Efficiency

(20:34) Recent Launch of Text to Speech Product

(25:01) Multimodality & Building Blocks

(25:54) What’s Next at Cartesia?

(28:28) Latency in Text to Speech

(29:30) Choosing Research Problems Based on Aesthetic

(31:23) Product Demo

(32:48) Cartesia Team & Hiring

...more

More shows like No Priors: Artificial Intelligence | Technology | Startups

This Week in Startups by Jason Calacanis

This Week in Startups

1,290 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

537 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,093 Listeners

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

436 Listeners

Invest Like the Best with Patrick O'Shaughnessy by Colossus | Investing & Business Podcasts

Invest Like the Best with Patrick O'Shaughnessy

2,354 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

228 Listeners

Practical AI by Daniel Whitenack and Chris Benson

Practical AI

208 Listeners

Last Week in AI by Skynet Today

Last Week in AI

314 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

99 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

576 Listeners

Latent Space: The AI Engineer Podcast by Latent.Space

Latent Space: The AI Engineer Podcast

101 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

491 Listeners

AI + a16z by a16z

AI + a16z

34 Listeners

TBPN by John Coogan & Jordi Hays

TBPN

138 Listeners

Uncapped with Jack Altman by Alt Capital

Uncapped with Jack Altman

43 Listeners