Unsupervised Learning

Ep 71: CEO of TurboPuffer Simon Eskildsen on Building Smarter Retrieval, AI App Must-Have Features & Current State of Vector DBs


Listen Later

Fill out this short listener survey to help us improve the show: https://forms.gle/bbcRiPTRwKoG2tJx8

In this episode, Simon Eskildsen, co-founder and CEO of TurboPuffer, lays out a compelling vision for how AI-native infrastructure needs to evolve in an era where every application wants to connect massive amounts of context to large language models. He breaks down why traditional databases and even large context windows fall short—especially at scale—and why object-storage-native search is the inevitable next step. Drawing on his experience from Shopify and Readwise, Simon introduces the SCRAP framework to explain the limits of context stuffing and makes a clear case for why cost, recall, performance, and access control drive the need for smarter retrieval systems. From practical lessons in building highly reliable infra to hard technical problems in vector indexing, this conversation distills the future of AI infra into first principles—with clarity and depth.

 

[0:00] Intro

[0:49] The Evolution of AI Context Windows

[2:32] Challenges in AI Data Integration

[3:56] SCRAP: Scale, Cost, Recall, ACLs, and Performance

[9:21] The Rise of Object-Oriented Storage

[16:47] Turbo Puffer Use Cases

[22:32] Challenges in Vector Search

[27:02] Challenges in Query Planning and Data Filtering

[27:53] Focusing on Core Problems and Simplicity

[28:28] Customer Feedback and Future Directions

[29:11] Reliability and Simplicity in Design

[30:39] Evaluating Embedding Models and Search Performance

[32:17] The Role of Vectors in Search Engines

[34:16] Balancing Focus and Expansion

[35:57] AI Infrastructure and Market Trends

[38:36] The Future of Memory in AI

[43:01] Table Stakes for AI in SaaS Applications

[45:55] Multimodal Data and Market Observations

[46:57] Quickfire

 

With your co-hosts: 

@jacobeffron 

- Partner at Redpoint, Former PM Flatiron Health 

@patrickachase 

- Partner at Redpoint, Former ML Engineer LinkedIn 

@ericabrescia 

- Former COO Github, Founder Bitnami (acq’d by VMWare) 

@jordan_segall 

- Partner at Redpoint

...more
View all episodesView all episodes
Download on the App Store

Unsupervised LearningBy by Redpoint Ventures

  • 4.9
  • 4.9
  • 4.9
  • 4.9
  • 4.9

4.9

49 ratings


More shows like Unsupervised Learning

View all
This Week in Startups by Jason Calacanis

This Week in Startups

1,273 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

1,040 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

519 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

217 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

88 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

426 Listeners

The Logan Bartlett Show by by Redpoint Ventures

The Logan Bartlett Show

186 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

121 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

75 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

461 Listeners

AI + a16z by a16z

AI + a16z

31 Listeners

Lightcone Podcast by Y Combinator

Lightcone Podcast

22 Listeners

Training Data by Sequoia Capital

Training Data

43 Listeners

Uncapped with Jack Altman by Alt Capital

Uncapped with Jack Altman

35 Listeners

Cheeky Pint by Stripe

Cheeky Pint

13 Listeners