AI Engineering Podcast

Building Semantic Memory for AI With Cognee


Listen Later

Summary
In this episode of the AI Engineering Podcast, Vasilije Markovich talks about enhancing Large Language Models (LLMs) with memory to improve their accuracy. He discusses the concept of memory in LLMs, which involves managing context windows to enhance reasoning without the high costs of traditional training methods. He explains the challenges of forgetting in LLMs due to context window limitations and introduces the idea of hierarchical memory, where immediate retrieval and long-term information storage are balanced to improve application performance. Vasilije also shares his work on Cognee, a tool he's developing to manage semantic memory in AI systems, and discusses its potential applications beyond its core use case. He emphasizes the importance of combining cognitive science principles with data engineering to push the boundaries of AI capabilities and shares his vision for the future of AI systems, highlighting the role of personalization and the ongoing development of Cognee to support evolving AI architectures.


Announcements
  • Hello and welcome to the AI Engineering Podcast, your guide to the fast-moving world of building scalable and maintainable AI systems
  • Your host is Tobias Macey and today I'm interviewing Vasilije Markovic about adding memory to LLMs to improve their accuracy
Interview
  • Introduction
  • How did you get involved in machine learning?
  • Can you describe what "memory" is in the context of LLM systems?
  • What are the symptoms of "forgetting" that manifest when interacting with LLMs?
    • How do these issues manifest between single-turn vs. multi-turn interactions?
  • How does the lack of hierarchical and evolving memory limit the capabilities of LLM systems?
  • What are the technical/architectural requirements to add memory to an LLM system/application?
  • How does Cognee help to address the shortcomings of current LLM/RAG architectures?
  • Can you describe how Cognee is implemented?
    • Recognizing that it has only existed for a short time, how have the design and scope of Cognee evolved since you first started working on it?
  • What are the data structures that are most useful for managing the memory structures?
  • For someone who wants to incorporate Cognee into their LLM architecture, what is involved in integrating it into their applications?
    • How does it change the way that you think about the overall requirements for an LLM application?
  • For systems that interact with multiple LLMs, how does Cognee manage context across those systems? (e.g. different agents for different use cases)
  • There are other systems that are being built to manage user personalization in LLm applications, how do the goals of Cognee relate to those use cases? (e.g. Mem0 - https://github.com/mem0ai/mem0)
  • What are the unknowns that you are still navigating with Cognee?
  • What are the most interesting, innovative, or unexpected ways that you have seen Cognee used?
  • What are the most interesting, unexpected, or challenging lessons that you have learned while working on Cognee?
  • When is Cognee the wrong choice?
  • What do you have planned for the future of Cognee?
Contact Info
  • LinkedIn
Parting Question
  • From your perspective, what are the biggest gaps in tooling, technology, or training for AI systems today?
Closing Announcements
  • Thank you for listening! Don't forget to check out our other shows. The Data Engineering Podcast covers the latest on modern data management. Podcast.__init__ covers the Python language, its community, and the innovative ways it is being used.
  • Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.
  • If you've learned something or tried out a project from the show then tell us about it! Email [email protected] with your story.
  • To help other people find the show please leave a review on iTunes and tell your friends and co-workers.
Links
  • Cognee
  • Montenegro
  • Catastrophic Forgetting
  • Multi-Turn Interaction
  • RAG == Retrieval Augmented Generation
    • Podcast Episode
  • GraphRAG
    • Podcast Episode
  • Long-term memory
  • Short-term memory
  • Langchain
  • LlamaIndex
  • Haystack
  • dlt
    • Data Engineering Podcast Episode
  • Pinecone
    • Podcast Episode
  • Agentic RAG
  • Airflow
  • DAG == Directed Acyclic Graph
  • FalkorDB
  • Neo4J
  • Pydantic
  • AWS ECS
  • AWS SNS
  • AWS SQS
  • AWS Lambda
  • LLM As Judge
  • Mem0
  • QDrant
  • LanceDB
  • DuckDB
The intro and outro music is from Hitman's Lovesong feat. Paola Graziano by The Freak Fandango Orchestra/CC BY-SA 3.0
...more
View all episodesView all episodes
Download on the App Store

AI Engineering PodcastBy Tobias Macey

  • 4.3
  • 4.3
  • 4.3
  • 4.3
  • 4.3

4.3

6 ratings


More shows like AI Engineering Podcast

View all
The Cloudcast by Massive Studios

The Cloudcast

153 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

994 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

629 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

296 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

322 Listeners

Data Engineering Podcast by Tobias Macey

Data Engineering Podcast

139 Listeners

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion by AI & Data Today

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

144 Listeners

Practical AI by Practical AI LLC

Practical AI

189 Listeners

The Stack Overflow Podcast by The Stack Overflow Podcast

The Stack Overflow Podcast

63 Listeners

Last Week in AI by Skynet Today

Last Week in AI

281 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

88 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

124 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

63 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

423 Listeners

AI + a16z by a16z

AI + a16z

33 Listeners