No Priors: Artificial Intelligence | Technology | Startups

Personalizing AI Models with Kelvin Guu, Senior Staff Research Scientist, Google Brain


Listen Later

How do you personalize AI models? A popular school of thought in AI is to just dump all the data you need into pre-training or fine tuning. But that may be less efficient and less controllable than alternatives — using AI models as a reasoning engine against external data sources.

Kelvin Guu, Senior Staff Research Scientist at Google, joins Sarah and Elad this week to talk about retrieval, memory, training data attribution and model orchestration. At Google, he led some of the first efforts to leverage pre-trained LMs and neural retrievers, with >30 launches across multiple products. He has done some of the earliest work on retrieval-augmented language models (REALM) and training LLMs to follow instructions (FLAN).

No Priors is now on YouTube! Subscribe to the channel on YouTube and like this episode.

Show Links:

  • Kelvin Guu Website
  • Google Scholar
  • FLAN: Finetuned Language Models Are Zero-Shot Learners
  • Simfluence: Modeling the Influence of Individual Training Examples by Simulating Training Runs
  • ROME: Locating and Editing Factual Associations in GPT
  • Branch-Train-Merge: Scaling Expert Language Models with Unsupervised Domain Discovery
  • Large Language Models Struggle to Learn Long-Tail Knowledge 
  • Sign up for new podcasts every week. Email feedback to [email protected]

    Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Kelvin_Guu

    Show Notes:

    [1:44] - Kelvin’s background in math, statistics and natural language processing at Stanford

    [3:24] - The questions driving the REALM Paper

    [7:08] - Frameworks around retrieval augmentation & expert models

    [10:16] - Why is modularity important

    [11:36] - FLAN Paper and instruction following

    [13:28] - Updating model weights in real time and other continuous learning methods

    [15:08] - Simfluence Paper & explainability with large language models

    [18:11] - ROME paper, “Model Surgery” exciting research areas

    [19:51] - Personal opinions and thoughts on AI agents & research

    [24:59] - How the human brain compares to AGI regarding memory and emotions

    [28:08] - How models become more contextually available

    [30:45] - Accessibility of models

    [33:47] - Advice to future researchers

    ...more
    View all episodesView all episodes
    Download on the App Store

    No Priors: Artificial Intelligence | Technology | StartupsBy Conviction

    • 4.4
    • 4.4
    • 4.4
    • 4.4
    • 4.4

    4.4

    114 ratings


    More shows like No Priors: Artificial Intelligence | Technology | Startups

    View all
    This Week in Startups by Jason Calacanis

    This Week in Startups

    1,273 Listeners

    a16z Podcast by Andreessen Horowitz

    a16z Podcast

    1,040 Listeners

    The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

    The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

    519 Listeners

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) by Sam Charrington

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

    441 Listeners

    Practical AI by Practical AI LLC

    Practical AI

    192 Listeners

    Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

    Machine Learning Street Talk (MLST)

    88 Listeners

    Dwarkesh Podcast by Dwarkesh Patel

    Dwarkesh Podcast

    428 Listeners

    Unsupervised Learning by by Redpoint Ventures

    Unsupervised Learning

    50 Listeners

    Latent Space: The AI Engineer Podcast by swyx + Alessio

    Latent Space: The AI Engineer Podcast

    75 Listeners

    The Ben & Marc Show by Marc Andreessen, Ben Horowitz

    The Ben & Marc Show

    135 Listeners

    BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

    BG2Pod with Brad Gerstner and Bill Gurley

    461 Listeners

    AI + a16z by a16z

    AI + a16z

    31 Listeners

    Lightcone Podcast by Y Combinator

    Lightcone Podcast

    22 Listeners

    Training Data by Sequoia Capital

    Training Data

    43 Listeners

    Uncapped with Jack Altman by Alt Capital

    Uncapped with Jack Altman

    35 Listeners