Software Huddle

SQL Meets Vector Search with Linpeng Tang of MyScale


Listen Later

Welcome back to an episode where we're talking Vectors, Vector Databases, and AI with Linpeng Tang, CTO and co-founder of MyScale. MyScale is a super interesting technology. They're combining the best of OLAP databases with Vector Search. The project started back in 2019 where they forked ClickHouse and then adapted it to support Vector Storage, Indexing, and Search.

The really unique and cool thing is you get the familiarity and usability of SQL with the power of being able to compare the similarity between unstructured data.

We think this has really fascinating use cases for analytics well beyond what we're seeing with other vector database technology that's mostly restricted to building RAG models for LLMs. Also, because it's built on ClickHouse, MyScale is massively scalable, which is an area that many of the dedicated vector databases actually struggle with.

We cover a lot about how vector databases work, why they decided to build off of ClickHouse, and how they plan to open source the database.


Timestamps

02:29 Introduction

06:22 Value of a Vector Database

12:40 Forking ClickHouse

18:53 Transforming Clickhouse into a SQL vector database

32:08 Data modeling

32:56 What data can be Vectorized

38:37 Indexing

43:35 Achieving Scale

46:35 Bottlenecks

48:41 MyScale vs other dedicated Vector Databases

51:38 Going Open Source

56:04 Closing thoughts

...more
View all episodesView all episodes
Download on the App Store

Software HuddleBy Software Huddle

  • 5
  • 5
  • 5
  • 5
  • 5

5

4 ratings


More shows like Software Huddle

View all
Software Engineering Radio by se-radio@computer.org

Software Engineering Radio

271 Listeners

The Changelog: Software Development, Open Source by Changelog Media

The Changelog: Software Development, Open Source

291 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

624 Listeners

Soft Skills Engineering by Jamison Dance and Dave Smith

Soft Skills Engineering

285 Listeners

Founders by David Senra

Founders

2,084 Listeners

Syntax - Tasty Web Development Treats by Wes Bos & Scott Tolinski - Full Stack JavaScript Web Developers

Syntax - Tasty Web Development Treats

987 Listeners

Practical AI by Practical AI LLC

Practical AI

210 Listeners

My First Million by Hubspot Media

My First Million

2,641 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

9,829 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

489 Listeners

Oxide and Friends by Oxide Computer Company

Oxide and Friends

59 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

97 Listeners

The AI Daily Brief: Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief: Artificial Intelligence News and Analysis

559 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

509 Listeners

The Pragmatic Engineer by Gergely Orosz

The Pragmatic Engineer

64 Listeners