Training Data

Fireworks Founder Lin Qiao on How Fast Inference and Small Models Will Benefit Businesses


Listen Later

In the first wave of the generative AI revolution, startups and enterprises built on top of the best closed-source models available, mostly from OpenAI. The AI customer journey moves from training to inference, and as these first products find PMF, many are hitting a wall on latency and cost.


Fireworks Founder and CEO Lin Qiao led the PyTorch team at Meta that rebuilt the whole stack to meet the complex needs of the world’s largest B2C company. Meta moved PyTorch to its own non-profit foundation in 2022 and Lin started Fireworks with the mission to compress the timeframe of training and inference and democratize access to GenAI beyond the hyperscalers to let a diversity of AI applications thrive.


Lin predicts when open and closed source models will converge and reveals her goal to build simple API access to the totality of knowledge.


Hosted by: Sonya Huang and Pat Grady, Sequoia Capital 


Mentioned in this episode:

  • Pytorch: the leading framework for building deep learning models, originated at Meta and now part of the Linux Foundation umbrella
  • Caffe2 and ONNX: ML frameworks Meta used that PyTorch eventually replaced
  • Conservation of complexity: the idea that that every computer application has inherent complexity that cannot be reduced but merely moved between the backend and frontend, originated by Xerox PARC researcher Larry Tesler 
  • Mixture of Experts: a class of transformer models that route requests between different subsets of a model based on use case
  • Fathom: a product the Fireworks team uses for video conference summarization 
  • LMSYS Chatbot Arena: crowdsourced open platform for LLM evals hosted on Hugging Face


     00:00 - Introduction

    02:01 - What is Fireworks?

    02:48 - Leading Pytorch

    05:01 - What do researchers like about PyTorch?

    07:50 - How Fireworks compares to open source

    10:38 - Simplicity scales

    12:51 - From training to inference

    17:46 - Will open and closed source converge?

    22:18 - Can you match OpenAI on the Fireworks stack?

    26:53 - What is your vision for the Fireworks platform?

    31:17 - Competition for Nvidia?

    32:47 - Are returns to scale starting to slow down?

    34:28 - Competition

    36:32 - Lightning round

    ...more
    View all episodesView all episodes
    Download on the App Store

    Training DataBy Sequoia Capital

    • 4.2
    • 4.2
    • 4.2
    • 4.2
    • 4.2

    4.2

    36 ratings


    More shows like Training Data

    View all
    This Week in Startups by Jason Calacanis

    This Week in Startups

    1,273 Listeners

    a16z Podcast by Andreessen Horowitz

    a16z Podcast

    1,033 Listeners

    The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

    The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

    519 Listeners

    Invest Like the Best with Patrick O'Shaughnessy by Colossus | Investing & Business Podcasts

    Invest Like the Best with Patrick O'Shaughnessy

    2,316 Listeners

    Y Combinator Startup Podcast by Y Combinator

    Y Combinator Startup Podcast

    217 Listeners

    Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

    Machine Learning Street Talk (MLST)

    88 Listeners

    Dwarkesh Podcast by Dwarkesh Patel

    Dwarkesh Podcast

    408 Listeners

    No Priors: Artificial Intelligence | Technology | Startups by Conviction

    No Priors: Artificial Intelligence | Technology | Startups

    121 Listeners

    Unsupervised Learning by by Redpoint Ventures

    Unsupervised Learning

    39 Listeners

    Latent Space: The AI Engineer Podcast by swyx + Alessio

    Latent Space: The AI Engineer Podcast

    75 Listeners

    Crucible Moments by Sequoia Capital

    Crucible Moments

    92 Listeners

    The Ben & Marc Show by Marc Andreessen, Ben Horowitz

    The Ben & Marc Show

    135 Listeners

    BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

    BG2Pod with Brad Gerstner and Bill Gurley

    461 Listeners

    AI + a16z by a16z

    AI + a16z

    31 Listeners

    Lightcone Podcast by Y Combinator

    Lightcone Podcast

    22 Listeners

    Uncapped with Jack Altman by Alt Capital

    Uncapped with Jack Altman

    17 Listeners