Weaviate Podcast

David Garnitz on VectorFlow - Weaviate Podcast #66!


Listen Later

Hey everyone! Thank you so much for watching the 66th Weaviate Podcast with David Garnitz, the creator of VectorFlow! VectorFlow (open-sourced on GH and linked below) is a new tool for ingesting data into Vector Databases such as Weaviate! There is quite an interesting End-to-End stack emerging at the ingestion layer, from retrieving data from misc. sources such as Slack, Salesforce, GitHub, Google Drive, Notion, ... to then Chunking the Text (maybe with the use of Visual Document Layout parsers like what Unstructured is imagining), extracting Metadata potentially (say the "age" of an NBA player as in the Evaporate-Code+ research) -- then sending this data off to embedding model inference and unpacking that can of worms from inference acceleration to load balancing, and finally -- importing the vectors themselves to Weaviate! I learned so much from this conversation, I really hope you enjoy listening and please check out VectorFlow below!

VectorFlow: https://github.com/dgarnitz/vectorflow
Chapters
0:00 VectorFlow on GitHub!
0:52 Welcome David Garnitz!
1:17 Vector Flow, Founding Vision
2:00 Billions of Vectors in Weaviate!
4:20 End-to-end data importing
6:30 Metadata Extraction in Vector Database Flows
10:15 Vectorizing 100s of millions of billions of chunks
15:58 Fine-Tuning Embedding Models
23:50 Zero-Shot Models in Metadata and Chunking
36:36 Vector + SQL
42:45 Self-Driving Databases
49:23 Generative Feedback Loop REST API
51:38 GPT Cache
55:55 Building VectorFlow

...more
View all episodesView all episodes
Download on the App Store

Weaviate PodcastBy Weaviate

  • 4
  • 4
  • 4
  • 4
  • 4

4

4 ratings


More shows like Weaviate Podcast

View all
Fareed Zakaria GPS by CNN

Fareed Zakaria GPS

3,420 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

1,063 Listeners

Acquired by Ben Gilbert and David Rosenthal

Acquired

4,159 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

293 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

223 Listeners

DataFramed by DataCamp

DataFramed

268 Listeners

Practical AI by Practical AI LLC

Practical AI

192 Listeners

Last Week in AI by Skynet Today

Last Week in AI

296 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

9,304 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

427 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

129 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

89 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

461 Listeners

AI + a16z by a16z

AI + a16z

31 Listeners

OpenAI Podcast by OpenAI

OpenAI Podcast

33 Listeners