Vector Podcast

By Dmitry Kan

Vector Podcast is here to bring you the depth and breadth of Search Engine Technology, Product, Marketing, Business. In the podcast we talk with engineers, entrepreneurs, thinkers and tinkerers, wh

... more

5

22 ratings

Download on the App Store

Download on the App Store

Get it on Google Play

FAQs about Vector Podcast:

How many episodes does Vector Podcast have?

The podcast currently has 31 episodes available.

Vector Podcast episodes:

March 21, 2025 Adding ML layer to Search: Hybrid Search Optimizer with Daniel Wrigley and Eric Pugh
Vector Podcast website: https://vectorpodcast.com
Haystack US 2025: https://haystackconf.com/2025/
Federated search, Keyword & Neural Search, ML Optimisation, Pros and Cons of Hybrid search
It is fascinating and funny how things develop, but also turn around. In 2022-23 everyone was buzzing about hybrid search. In 2024 the conversation shifted to RAG, RAG, RAG. And now we are in 2025 and back to hybrid search - on a different level: finally there are strides and contributions towards making hybrid search parameters learnt with ML. How cool is that?
Design: Saurabh Rai, https://www.linkedin.com/in/srbhr/
The design of this episode is inspired by a scene in Blade Runner 2049. There's a clear path leading towards where people want to go to, yet they're searching for something.
00:00 Intro
00:54 Eric's intro and Daniel's background
02:50 Importance of Hybrid search: Daniel's take
07:26 Eric's take
10:57 Dmitry's take
11:41 Eric's predictions
13:47 Doug's blog on RRF is not enough
16:18 How to not fall short of the blind picking in RRF: score normalization, combinations and weights
25:03 The role of query understanding: feature groups
35:11 Lesson 1 from Daniel: Simple models might be all you need
36:30 Lesson 2: query features might be all you need
38:30 Reasoning capabilities in search
40:02 Question from Eric: how is this different from Learning To Rank?
42:46 Carrying the past in Learning To Rank / any rank
44:21 Demo!
51:52 How to consume this in OpenSearch
55:15 What's next
58:44 Haystack US 2025
...more
1h 4min
March 02, 2025 Vector Databases: The Rise, Fall and Future - by NotebookLM
https://www.vectorpodcast.com/
I had fun interacting with NotebookLM - mostly for self-educational purposes. I think this tool can help by bringing an additional perspective over a textual content. It ties to what RAG (Retrieval Augmented Generation) can do to content generation in another modality. In this case, text is used to augment the generation of a podcast episode.
This episode is based on my blog post: https://dmitry-kan.medium.com/the-rise-fall-and-future-of-vector-databases-how-to-pick-the-one-that-lasts-6b9fbb43bbbe
Time codes:
00:00 Intro to the topic
1:11 Dmitry's knowledge in the space
1:54 Unpacking the Rise & Fall idea
3:14 How attention got back to Vector DBs for a bit
4:18 Getting practical: Dmitry's guide for choosing the right Vector Database
4:39 FAISS
5:34 What if you need fine-grained keyword search? Look at Apache Lucene-based engines
6:41 Exception to the rule: Late-interaction models
8:30 Latency and QPS: GSI APU, Vespa, Hyperspace
9:28 Strategic approach
9:55 Cloud solutions: CosmosDB, Vertex AI, Pinecone, Weaviate Cloud
10:14 Community voice: pgvector
10:48 Picture of the fascinating future of the field
12:23 Question to the audience
12:44 Taking a step back: key points
13:45 Don't get caught up in trendy shiny new tech
...more
20min
February 10, 2025 Code search, Copilot, LLM prompting with empathy and Artifacts with John Berryman
Vector Podcast website: https://vectorpodcast.com
Get your copy of John's new book "Prompt Engineering for LLMs: The Art and Science of Building Large Language Model–Based Applications": https://amzn.to/4fMj2Ef
John Berryman is the founder and principal consultant of Arcturus Labs, where he specializes in AI application development (Agency and RAG). As an early engineer on GitHub Copilot, John contributed to the development of its completions and chat functionalities, working at the forefront of AI-assisted coding tools. John is coauthor of "Prompt Engineering for LLMs" (O'Reilly).Before his work on Copilot, John's focus was search technology. His diverse experience includes helping to develop next-generation search system for the US Patent Office, building search and recommendations for Eventbrite, and contributing to GitHub's code search infrastructure. John is also coauthor of "Relevant Search" (Manning), a book that distills his expertise in the field.John's unique background, spanning both cutting-edge AI applications and foundational search technologies, positions him at the forefront of innovation in LLM applications and information retrieval.
00:00 Intro
02:19 John's background and story in search and ML
06:03 Is RAG just a prompt engineering technique?
10:15 John's progression from a search engineer to ML researcher
13:40 LLM predictability vs more traditional programming
22:31 Code assist with GitHub Copilot
29:44 Role of keyword search for code at GitHub
35:01 GenAI: existential risk or pure magic? AI Natives
39:40 What are Artifacts
46:59 Demo!
55:13 Typed artifacts, tools, accordion artifacts
56:21 From Web 2.0 to Idea exchange
57:51 Spam will transform into Slop
58:56 John's new book and Acturus Labs intro
Show notes:
- John Berryman on X: https://x.com/JnBrymn
- Acturus Labs: https://arcturus-labs.com/
- John's blog on Artifacts (see demo in the episode): https://arcturus-labs.com/blog/2024/11/11/cut-the-chit-chat-with-artifacts/
Watch on https://youtu.be/60HAtHVBYj8
...more
1h 8min
January 17, 2025Debunking myths of vector search and LLMs with Leo Boytsov
00:00 Intro
01:31 Leo's story
09:59 SPLADE: single model to solve both dense and sparse?
21:06 DeepImpact
29:58 NMSLIB: what are non-metric spaces
34:21 How HNSW and NMSLIB joined forces
41:11 Why FAISS did not choose NMSLIB's algorithm
43:36 Serendipity of discovery and the creation of industries
47:06 Vector Search: intellectually rewarding, professionally undervalued
52:37 Why RDBMS Still Struggles with Scalable Vector and Free-Text Search
1:00:16 Leo's recent favorite papers
Leo Boytsov on LinkedIn: https://www.linkedin.com/in/leonidboytsov/ and X: https://x.com/srchvrs
Leo Boytsov’s paper list: https://scholar.google.com/citations?hl=en&user=I79y2i4AAAAJ&view_op=list_works&sortby=pubdate
Lots of papers and other material from Leo: https://www.youtube.com/watch?v=gzWErcOXIKk
...more
1h 8min
November 07, 2024 Berlin Buzzwords 2024 - Alessandro Benedetti - LLMs in Solr
Alessandro's talk on Hybrid Search with Apache Solr Reciprocal Rank Fusion: https://www.youtube.com/watch?v=8x2cbT5CCEM&list=PLq-odUc2x7i8jHpa6PHGzmxfAPEz-c-on&index=5
00:00 Intro
00:50 Alessandro's take on the bbuzz'24 conference
01:25 What and value of hybrid search
04:55 Explainability of vector search results to users
09:27 Explainability of vector search results to search engineers
13:12 State of hybrid search in Apache Solr
14:32 What's in Reciprocal Rank Fusion beyond round-robin?
18:30 Open source for LLMs
22:48 How we should approach this issue in business and research
26:12 How to maintain the status of an open-source LLM / system
30:06 Prompt engineering (hope and determinism)
34:03 DSpy
35:16 What's next in Solr
...more
39min
September 19, 2024 Berlin Buzzwords 2024 - Sonam Pankaj - EmbedAnything
Video: https://youtu.be/dVIPBxHJ1kQ
00:00 Intro
00:15 Greets for Sonam
01:02 Importance of metric learning
3:37 Sonam's background: Rasa, Qdrant
4:31 What's EmbedAnything
5:52 What a user gets
8:48 Do I need to know Rust?
10:18 Call-out to the community
10:35 Multimodality
12:32 How to evaluate quality of LLM-based systems
16:38 QA for multimodal use cases
18:17 Place for a human in the LLM craze
19:00 Use cases for EmbedAnything
20:54 Closing theme (a longer one - enjoy!)
Show notes:
- GitHub: https://github.com/StarlightSearch/EmbedAnything
- HuggingFace Candle: https://github.com/huggingface/candle
- Sonam's talk on Berlin Buzzwords 2024: https://www.youtube.com/watch?v=YfR3kuSo-XQ
- Removing GIL from Python: https://peps.python.org/pep-0703
- Blind pairs in CLIP: https://arxiv.org/abs/2401.06209
- Dark matter of intelligence: https://ai.meta.com/blog/self-supervised-learning-the-dark-matter-of-intelligence/
- Rasa chatbots: https://github.com/RasaHQ/rasa
- Prometheus: https://github.com/prometheus-eval/prometheus-eval
- Dino: https://github.com/facebookresearch/dino
...more
23min
July 18, 2024 Berlin Buzzwords 2024 - Doug Turnbull - Learning in Public
00:00 Intro
00:30 Greets for Doug
01:46 Apache Solr and stuff
03:08 Hello LTR project
04:42 Secret sauce of Doug's continuous blogging
08:50 SearchArray
13:22 Running complex ML experiments
17:29 Efficient search orgs
22:58 Writing a book on search and AI
Show notes:
- Doug's talk on Learning To Rank at Reddit delivered at the Berlin Buzzwords 2024 conference: https://www.youtube.com/watch?v=gUtF1gyHsSM
- Hello LTR: https://github.com/o19s/hello-ltr
- Lexical search for pandas with SearchArray: https://github.com/softwaredoug/searcharray
- https://softwaredoug.com/
- What AI Engineers Should Know about Search: https://softwaredoug.com/blog/2024/06/25/what-ai-engineers-need-to-know-search
- AI Powered Search: https://www.manning.com/books/ai-powered-search
- Quepid: https://github.com/o19s/quepid
- Branching out in your ML / search experiments: https://dvc.org/doc/use-cases
- Doug on Twitter: https://x.com/softwaredoug
- Doug on LinkedIn: https://www.linkedin.com/in/softwaredoug/
...more
28min
June 26, 2024 Eric Pugh - Measuring Search Quality with Quepid
00:00 Intro
00:21 Guest Introduction: Eric Pugh
03:00 Eric's story in search and the evolution of search technology
7:27 Quepid: Improving Search Relevancy
10:08 When to use Quepid
14:53 Flash back to Apache Solr 1.4 and the book (of which Eric is one author)
17:49 Quepid Demo and Future Enhancements
23:57 Real-Time Query Doc Pairs with WebSockets
24:16 Integrating Quepid with Search Engines
25:57 Introducing LLM-Based Judgments
28:05 Scaling Up Judgments with AI
28:48 Data Science Notebooks in Quepid
33:23 Custom Scoring in Quepid
39:23 API and Developer Tools
42:17 The Future of Search and Personal Reflections
Show notes:
- Hosted Quepid: https://app.quepid.com/
- Ragas: Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines https://github.com/explodinggradients...
- Why Quepid: https://quepid.com/why-quepid/
- Quepid on Github: https://github.com/o19s/quepid
...more
48min
May 15, 2024 Sid Probstein, part II - Bring AI to company data with SWIRL
00:00 Intro
01:54 Reflection on the past year in AI
08:08 Reader LLM (and RAG)
12:36 Does it need fine-tuning to a domain?
14:20 How LLMs can lie
17:32 What if data isn't perfect
21:21 SWIRL's secret sauce with Reader LLM
23:55 Feedback loop
26:14 Some surprising client perspective
31:17 How Gen AI can change communication interfaces
34:11 Call-out to the Community
...more
39min
May 01, 2024 Louis Brandy - SQL meets Vector Search at Rockset
00:00 Intro
00:42 Louis's background
05:39 From Facebook to Rockset
07:41 Embeddings prior to deep learning / LLM era
12:35 What's Rockset as a product
15:27 Use cases
18:04 RocksDB as part of Rockset
20:33 AI capabilities: ANN index, hybrid search
25:11 Types of hybrid search
28:05 Can one learn the alpha?
30:03 Louis's prediction of the future of vector search
33:55 RAG and other AI capabilities
41:46 Call out to the Vector Search community
46:16 Vector Databases vs Databases
49:16 Question of WHY
...more
53min

FAQs about Vector Podcast:

How many episodes does Vector Podcast have?

The podcast currently has 31 episodes available.

More shows like Vector Podcast

Common Sense with Dan Carlin by Dan Carlin

Common Sense with Dan Carlin

11,313 Listeners

Fareed Zakaria GPS by CNN

Fareed Zakaria GPS

3,474 Listeners

Founders by David Senra

Founders

1,906 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

298 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

322 Listeners

Pod Save America by Crooked Media

Pod Save America

86,615 Listeners

The Ezra Klein Show by New York Times Opinion

The Ezra Klein Show

15,237 Listeners