No Priors: Artificial Intelligence | Technology | Startups

The evolution and promise of RAG architecture with Tengyu Ma from Voyage AI


Listen Later

After Tengyu Ma spent years at Stanford researching AI optimization, embedding models, and transformers, he took a break from academia to start Voyage AI which allows enterprise customers to have the most accurate retrieval possible through the most useful foundational data. Tengyu joins Sarah on this week’s episode of No priors to discuss why RAG systems are winning as the dominant architecture in enterprise and the evolution of foundational data that has allowed RAG to flourish. And while fine-tuning is still in the conversation, Tengyu argues that RAG will continue to evolve as the cheapest, quickest, and most accurate system for data retrieval. 


They also discuss methods for growing context windows and managing latency budgets, how Tengyu’s research has informed his work at Voyage, and the role academia should play as AI grows as an industry. 


Show Links:

  • Voyage AI
  • Stanford Assistant Professor of Computer Science
  • Tengyu Ma Key Research Papers:
  • Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
  • Non-convex optimization for machine learning: design, analysis, and understanding
  • Provable Guarantees for Self-Supervised Deep Learning with Spectral Contrastive Loss
  • Larger language models do in-context learning differently, 2023
  • Why Do Pretrained Language Models Help in Downstream Tasks? An Analysis of Head and Prompt Tuning
  • On the Optimization Landscape of Tensor Decompositions

  • Sign up for new podcasts every week. Email feedback to [email protected]

    Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @tengyuma


    Show Notes: 

    (0:00) Introduction

    (1:59) Key points of Tengyu’s research

    (4:28) Academia compared to industry

    (6:46) Voyage AI overview

    (9:44) Enterprise RAG use cases

    (15:23) LLM long-term memory and token limitations

    (18:03) Agent chaining and data management

    (22:01) Improving enterprise RAG 

    (25:44) Latency budgets

    (27:48) Advice for building RAG systems

    (31:06) Learnings as an AI founder

    (32:55) The role of academia in AI

    ...more
    View all episodesView all episodes
    Download on the App Store

    No Priors: Artificial Intelligence | Technology | StartupsBy Conviction

    • 4.6
    • 4.6
    • 4.6
    • 4.6
    • 4.6

    4.6

    93 ratings


    More shows like No Priors: Artificial Intelligence | Technology | Startups

    View all
    This Week in Startups by Jason Calacanis

    This Week in Startups

    1,281 Listeners

    a16z Podcast by Andreessen Horowitz

    a16z Podcast

    1,008 Listeners

    The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

    The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

    525 Listeners

    Greymatter by Greylock Partners

    Greymatter

    121 Listeners

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

    439 Listeners

    Invest Like the Best with Patrick O'Shaughnessy by Colossus | Investing & Business Podcasts

    Invest Like the Best with Patrick O'Shaughnessy

    2,329 Listeners

    Y Combinator Startup Podcast by Y Combinator

    Y Combinator Startup Podcast

    214 Listeners

    Practical AI by Practical AI LLC

    Practical AI

    196 Listeners

    All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

    All-In with Chamath, Jason, Sacks & Friedberg

    8,385 Listeners

    Dwarkesh Podcast by Dwarkesh Patel

    Dwarkesh Podcast

    315 Listeners

    The Logan Bartlett Show by by Redpoint Ventures

    The Logan Bartlett Show

    189 Listeners

    Latent Space: The AI Engineer Podcast by swyx + Alessio

    Latent Space: The AI Engineer Podcast

    70 Listeners

    The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

    The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

    397 Listeners

    The Ben & Marc Show by Marc Andreessen, Ben Horowitz

    The Ben & Marc Show

    106 Listeners

    BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

    BG2Pod with Brad Gerstner and Bill Gurley

    419 Listeners