No Priors: Artificial Intelligence | Technology | Startups

The evolution and promise of RAG architecture with Tengyu Ma from Voyage AI


Listen Later

After Tengyu Ma spent years at Stanford researching AI optimization, embedding models, and transformers, he took a break from academia to start Voyage AI which allows enterprise customers to have the most accurate retrieval possible through the most useful foundational data. Tengyu joins Sarah on this week’s episode of No priors to discuss why RAG systems are winning as the dominant architecture in enterprise and the evolution of foundational data that has allowed RAG to flourish. And while fine-tuning is still in the conversation, Tengyu argues that RAG will continue to evolve as the cheapest, quickest, and most accurate system for data retrieval. 


They also discuss methods for growing context windows and managing latency budgets, how Tengyu’s research has informed his work at Voyage, and the role academia should play as AI grows as an industry. 


Show Links:

  • Voyage AI
  • Stanford Assistant Professor of Computer Science
  • Tengyu Ma Key Research Papers:
  • Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
  • Non-convex optimization for machine learning: design, analysis, and understanding
  • Provable Guarantees for Self-Supervised Deep Learning with Spectral Contrastive Loss
  • Larger language models do in-context learning differently, 2023
  • Why Do Pretrained Language Models Help in Downstream Tasks? An Analysis of Head and Prompt Tuning
  • On the Optimization Landscape of Tensor Decompositions

  • Sign up for new podcasts every week. Email feedback to [email protected]

    Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @tengyuma


    Show Notes: 

    (0:00) Introduction

    (1:59) Key points of Tengyu’s research

    (4:28) Academia compared to industry

    (6:46) Voyage AI overview

    (9:44) Enterprise RAG use cases

    (15:23) LLM long-term memory and token limitations

    (18:03) Agent chaining and data management

    (22:01) Improving enterprise RAG 

    (25:44) Latency budgets

    (27:48) Advice for building RAG systems

    (31:06) Learnings as an AI founder

    (32:55) The role of academia in AI

    ...more
    View all episodesView all episodes
    Download on the App Store

    No Priors: Artificial Intelligence | Technology | StartupsBy Conviction

    • 4.4
    • 4.4
    • 4.4
    • 4.4
    • 4.4

    4.4

    114 ratings


    More shows like No Priors: Artificial Intelligence | Technology | Startups

    View all
    This Week in Startups by Jason Calacanis

    This Week in Startups

    1,273 Listeners

    a16z Podcast by Andreessen Horowitz

    a16z Podcast

    1,040 Listeners

    The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

    The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

    519 Listeners

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

    441 Listeners

    Practical AI by Practical AI LLC

    Practical AI

    192 Listeners

    Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

    Machine Learning Street Talk (MLST)

    88 Listeners

    Dwarkesh Podcast by Dwarkesh Patel

    Dwarkesh Podcast

    426 Listeners

    Unsupervised Learning by by Redpoint Ventures

    Unsupervised Learning

    50 Listeners

    Latent Space: The AI Engineer Podcast by swyx + Alessio

    Latent Space: The AI Engineer Podcast

    75 Listeners

    The Ben & Marc Show by Marc Andreessen, Ben Horowitz

    The Ben & Marc Show

    135 Listeners

    BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

    BG2Pod with Brad Gerstner and Bill Gurley

    461 Listeners

    AI + a16z by a16z

    AI + a16z

    31 Listeners

    Lightcone Podcast by Y Combinator

    Lightcone Podcast

    22 Listeners

    Training Data by Sequoia Capital

    Training Data

    43 Listeners

    Uncapped with Jack Altman by Alt Capital

    Uncapped with Jack Altman

    35 Listeners