No Priors: Artificial Intelligence | Technology | Startups

The evolution and promise of RAG architecture with Tengyu Ma from Voyage AI


Listen Later

After Tengyu Ma spent years at Stanford researching AI optimization, embedding models, and transformers, he took a break from academia to start Voyage AI which allows enterprise customers to have the most accurate retrieval possible through the most useful foundational data. Tengyu joins Sarah on this week’s episode of No priors to discuss why RAG systems are winning as the dominant architecture in enterprise and the evolution of foundational data that has allowed RAG to flourish. And while fine-tuning is still in the conversation, Tengyu argues that RAG will continue to evolve as the cheapest, quickest, and most accurate system for data retrieval. 


They also discuss methods for growing context windows and managing latency budgets, how Tengyu’s research has informed his work at Voyage, and the role academia should play as AI grows as an industry. 


Show Links:

  • Voyage AI
  • Stanford Assistant Professor of Computer Science
  • Tengyu Ma Key Research Papers:
  • Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
  • Non-convex optimization for machine learning: design, analysis, and understanding
  • Provable Guarantees for Self-Supervised Deep Learning with Spectral Contrastive Loss
  • Larger language models do in-context learning differently, 2023
  • Why Do Pretrained Language Models Help in Downstream Tasks? An Analysis of Head and Prompt Tuning
  • On the Optimization Landscape of Tensor Decompositions

  • Sign up for new podcasts every week. Email feedback to [email protected]

    Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @tengyuma


    Show Notes: 

    (0:00) Introduction

    (1:59) Key points of Tengyu’s research

    (4:28) Academia compared to industry

    (6:46) Voyage AI overview

    (9:44) Enterprise RAG use cases

    (15:23) LLM long-term memory and token limitations

    (18:03) Agent chaining and data management

    (22:01) Improving enterprise RAG 

    (25:44) Latency budgets

    (27:48) Advice for building RAG systems

    (31:06) Learnings as an AI founder

    (32:55) The role of academia in AI

    ...more
    View all episodesView all episodes
    Download on the App Store

    No Priors: Artificial Intelligence | Technology | StartupsBy Conviction

    • 4.4
    • 4.4
    • 4.4
    • 4.4
    • 4.4

    4.4

    112 ratings


    More shows like No Priors: Artificial Intelligence | Technology | Startups

    View all
    This Week in Startups by Jason Calacanis

    This Week in Startups

    1,272 Listeners

    a16z Podcast by Andreessen Horowitz

    a16z Podcast

    1,013 Listeners

    The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

    The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

    509 Listeners

    Invest Like the Best with Patrick O'Shaughnessy by Colossus | Investing & Business Podcasts

    Invest Like the Best with Patrick O'Shaughnessy

    2,290 Listeners

    Y Combinator Startup Podcast by Y Combinator

    Y Combinator Startup Podcast

    211 Listeners

    Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

    Machine Learning Street Talk (MLST)

    89 Listeners

    Dwarkesh Podcast by Dwarkesh Patel

    Dwarkesh Podcast

    354 Listeners

    The Logan Bartlett Show by by Redpoint Ventures

    The Logan Bartlett Show

    189 Listeners

    Latent Space: The AI Engineer Podcast by swyx + Alessio

    Latent Space: The AI Engineer Podcast

    75 Listeners

    The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

    The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

    440 Listeners

    The Ben & Marc Show by Marc Andreessen, Ben Horowitz

    The Ben & Marc Show

    128 Listeners

    BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

    BG2Pod with Brad Gerstner and Bill Gurley

    445 Listeners

    AI + a16z by a16z

    AI + a16z

    30 Listeners

    Lightcone Podcast by Y Combinator

    Lightcone Podcast

    21 Listeners

    Training Data by Sequoia Capital

    Training Data

    37 Listeners