Training Data

By Sequoia Capital

Join us as we train our neural nets on the theme of the century: AI. Sonya Huang, Pat Grady and more Sequoia Capital partners host conversations with leading AI builders and researchers to ask critica... more

4.2

3636 ratings

Download on the App Store

Download on the App Store

Get it on Google Play

FAQs about Training Data:

How many episodes does Training Data have?

The podcast currently has 61 episodes available.

Training Data episodes:

August 19, 2025 Scaling the ‘Cursor for Slides’ to $50M ARR: Gamma founder Jon Noronha
Before ChatGPT made AI mainstream, John Noronha was building Gamma with a simple insight: everyone hates making slides but needs visual communication for high-stakes ideas. His background at Optimizely proved crucial as Gamma became a testing laboratory for AI models, running hundreds of experiments to discover that Claude excels at creative taste, Gemini wins on cost efficiency, and reasoning models actually hurt creativity. John explains how solving their own blank page problem inadvertently solved it for millions of users, turning a near-failing startup into a cash flow positive platform with 250 million presentations created. He discusses competing with PowerPoint's 500 million users while expanding beyond slides into documents, websites and visual storytelling.
Hosted by Sonya Huang, Sequoia Capital
...more
31min
August 12, 2025Delphi’s Dara Ladjevardian: How AI Digital Minds Can Scale Human Connection
Dara Ladjevardian, founder and CEO of Delphi, is creating digital minds that allow people to scale their thoughts and availability without replacing human connection. Inspired by Ray Kurzweil’s theory of mind as a hierarchy of pattern recognizers, Dara built an adaptive temporal knowledge graph that captures how people think and reason. From helping CEOs train new hires to enabling coaches to monetize their expertise 24/7, Delphi represents a new form of conversational media. Dara explains why authentic human representation matters, how digital minds actually increase desire for real human connection, and why he believes 2026 will be the tipping point for adoption for digital minds.

Hosted by Sonya Huang and Jess Lee, Sequoia Capital

Mentioned in this episode:
How to Create a Mind: 2012 book by Ray Kurzweil that inspired Dara

The Memoirs of Akbar Ladjevardian: 2008 book about Dara’s grandfather, an Iranian industrialist, that led him to create his first “digital mind”

Build: 2022 book by Tony Fadell that refers to itself as “a mentor in a box”; another inspiration for Dara

The 2 Sigma Problem: 1984 paper by Benjamin Bloom about how students that receive one-on-one tutoring perform two standard deviations better than students educated in a classroom environment
...more
40min
August 05, 2025 Vercel CEO Guillermo Rauch: Building the Generative Web with AI
Vercel CEO Guillermo Rauch has spent years obsessing over reducing the friction between having an idea and getting it online. Now with AI, he's achieving something even more ambitious: making software creation accessible to anyone with a keyboard. Guillermo explains how v0 has grown to 3 million users by focusing on reliability and quality, why ChatGPT has become their fastest-growing customer acquisition channel, and how AI is enabling “virtual coworkers” across design, development, and marketing. He shares his contrarian view that the future belongs to ephemeral, generated-on-demand applications rather than traditional installed software, and why he believes we're on the cusp of the biggest transformation to the web in its history.
Hosted by Sonya Huang and Pat Grady, Sequoia Capital

...more
1h 1min
July 30, 2025 OpenAI’s IMO Team on Why Models Are Finally Solving Elite-Level Math
In just two months, a scrappy three-person team at OpenAI sprinted to fulfill what the entire AI field has been chasing for years—gold-level performance on the International Mathematical Olympiad problems. Alex Wei, Sheryl Hsu and Noam Brown discuss their unique approach using general-purpose reinforcement learning techniques on hard-to-verify tasks rather than formal verification tools. The model showed surprising self-awareness by admitting it couldn’t solve problem six, and revealed the humbling gap between solving competition problems and genuine mathematical research breakthroughs.
Hosted by Sonya Huang, Sequoia Capital

...more
31min
July 22, 2025 OpenAI Just Released ChatGPT Agent, Its Most Powerful Agent Yet
Isa Fulford, Casey Chu, and Edward Sun from OpenAI's ChatGPT agent team reveal how they combined Deep Research and Operator into a single, powerful AI agent that can perform complex, multi-step tasks lasting up to an hour. By giving the model access to a virtual computer with text browsing, visual browsing, terminal access, and API integrations—all with shared state—they've created what may be the first truly embodied AI assistant. The team discusses their reinforcement learning approach, safety mitigations for real-world actions, and how small teams can build transformative AI products through close research-applied collaboration.
Hosted by Sonya Huang and Lauren Reeder, Sequoia Capital

...more
38min
July 11, 2025 DeepMind's Pushmeet Kohli on AI's Scientific Revolution
Pushmeet Kohli leads AI for Science at DeepMind, where his team has created AlphaEvolve, an AI system that discovers entirely new algorithms and proves mathematical results that have eluded researchers for decades. From improving 50-year-old matrix multiplication algorithms to generating interpretable code for complex problems like data center scheduling, AlphaEvolve represents a new paradigm where LLMs coupled with evolutionary search can outperform human experts. Pushmeet explains the technical architecture behind these breakthroughs and shares insights from collaborations with mathematicians like Terence Tao, while discussing how AI is accelerating scientific discovery across domains from chip design to materials science.
Hosted by Sonya Huang and Pat Grady, Sequoia Capital

...more
42min
July 08, 2025Mapping the Mind of a Neural Net: Goodfire’s Eric Ho on the Future of Interpretability
Eric Ho is building Goodfire to solve one of AI’s most critical challenges: understanding what’s actually happening inside neural networks. His team is developing techniques to understand, audit and edit neural networks at the feature level. Eric discusses breakthrough results in resolving superposition through sparse autoencoders, successful model editing demonstrations and real-world applications in genomics with Arc Institute's DNA foundation models. He argues that interpretability will be critical as AI systems become more powerful and take on mission-critical roles in society.
Hosted by Sonya Huang and Roelof Botha, Sequoia Capital
Mentioned in this episode:
Mech interp: Mechanistic interpretability, list of important papers here

Phineas Gage: 19th century railway engineer who lost most of his brain’s left frontal lobe in an accident. Became a famous case study in neuroscience.

Human Genome Project: Effort from 1990-2003 to generate the first sequence of the human genome which accelerated the study of human biology

Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

Zoom In: An Introduction to Circuits: First important mechanistic interpretability paper from OpenAI in 2020

Superposition: Concept from physics applied to interpretability that allows neural networks to simulate larger networks (e.g. more concepts than neurons)

Apollo Research: AI safety company that designs AI model evaluations and conducts interpretability research

Towards Monosemanticity: Decomposing Language Models With Dictionary Learning. 2023 Anthropic paper that uses a sparse autoencoder to extract interpretable features; followed by Scaling Monosemanticity

Under the Hood of a Reasoning Model: 2025 Goodfire paper that interprets DeepSeek’s reasoning model R1

Auto-interpretability: The ability to use LLMs to automatically write explanations for the behavior of neurons in LLMs

Interpreting Evo 2: Arc Institute's Next-Generation Genomic Foundation Model. (see episode with Arc co-founder Patrick Hsu)

Paint with Ember: Canvas interface from Goodfire that lets you steer an LLM’s visual output in real time (paper here)

Model diffing: Interpreting how a model differs from checkpoint to checkpoint during finetuning

Feature steering: The ability to change the style of LLM output by up or down weighting features (e.g. talking like a pirate vs factual information about the Andromeda Galaxy)

Weight based interpretability: Method for directly decomposing neural network parameters into mechanistic components, instead of using features

The Urgency of Interpretability: Essay by Anthropic founder Dario Amodei
On the Biology of a Large Language Model: Goodfire collaboration with Anthropic
...more
48min
July 01, 2025ElevenLabs’ Mati Staniszewski: Why Voice Will Be the Fundamental Interface for Tech
Mati Staniszewski, co-founder and CEO of ElevenLabs, explains how staying laser-focused on audio innovation has allowed his company to thrive despite the push into multimodality from foundation models. From a high school friendship in Poland to building one of the fastest-growing AI companies, Mati shares how ElevenLabs transformed text-to-speech with contextual understanding and emotional delivery. He discusses the company's viral moments (from Harry Potter by Balenciaga to powering Darth Vader in Fortnite), and explains how ElevenLabs is creating the infrastructure for voice agents and real-time translation that could eliminate language barriers worldwide.
Hosted by: Pat Grady, Sequoia Capital
Mentioned in this episode:
Attention Is All You Need: The original Transformers paper

Tortoise-tts: Open source text to speech model that was a starting point for ElevenLabs (which now maintains a v2)

Harry Potter by Balenciaga: ElevenLabs’ first big viral moment from 2023

The first AI that can laugh: 2022 blog post backing up ElevenLab’s claim of laughter (it got better in v3)

Darth Vader's voice in Fortnite: ElevenLabs used actual voice clips provided by James Earl Jones before he died

Lex Fridman interviews Prime Minister Modi: ElevenLabs enabled Fridman to speak in Hindi and Modi to speak in English.

Time Person of the Year 2024: ElevenLabs-powered experiment with “conversational journalism”

Iconic Voices: Richard Feynman, Deepak Chopra, Maya Angelou and more available in ElevenLabs reader app

SIP trunking: a method of delivering voice, video, and other unified communications over the internet using the Session Initiation Protocol (SIP)

Genesys: Leading enterprise CX platform for agentic AI

Hitchhiker’s Guide to the Galaxy: Comedy/science-fiction series by Douglas Adams that contains the concept of the Babel Fish instantaneous translator, cited by Mati

FYI: communication and productivity app for creatives that Mati uses, founded by will.i.am

Lovable: prototyping app that Mati loves
...more
1h
June 24, 2025From DevOps ‘Heart Attacks’ to AI-Powered Diagnostics With Traversal’s AI Agents
Anish Agarwal and Raj Agrawal, co-founders of Traversal, are transforming how enterprises handle critical system failures. Their AI agents can perform root cause analysis in 2-4 minutes instead of the hours typically spent by teams of engineers scrambling in Slack channels. Drawing from their academic research in causal inference and gene regulatory networks, they’ve built agents that systematically traverse complex dependency maps to identify the smoking gun logs and problematic code changes. As AI-generated code becomes more prevalent, Traversal addresses a growing challenge: debugging systems where humans didn’t write the original code, making AI-powered troubleshooting essential for maintaining reliable software at scale.
Hosted by Sonya Huang and Bogomil Balkansky, Sequoia Capital
Mentioned in this episode:
SRE: Site reliability engineering. The function within engineering teams that monitors and improves the availability and performance of software systems and services.

Golden signals: four key metrics used by Site Reliability Engineers (SREs) to monitor the health and performance of IT systems: latency, traffic, errors and saturation.

MELT data: Metrics, events, log, and traces. A framework for observability.

The Bitter Lesson: Another mention of Nobel Prize winner Rich Sutton’s influential post.
...more
41min
June 17, 2025The Breakthroughs Needed for AGI Have Already Been Made: OpenAI Former Research Head Bob McGrew
As OpenAI's former Head of Research, Bob McGrew witnessed the company's evolution from GPT-3’s breakthrough to today's reasoning models. He argues that there are three legs of the stool for AGI—Transformers, scaled pre-training, and reasoning—and that the fundamentals that will shape the next decade-plus are already in place. He thinks 2025 will be defined by reasoning while pre-training hits diminishing returns. Bob discusses why the agent economy will price services at compute costs due to near-infinite supply, fundamentally disrupting industries like law and medicine, and how his children use ChatGPT to spark curiosity and agency. From robotics breakthroughs to managing brilliant researchers, Bob offers a unique perspective on AI’s trajectory and where startups can still find defensible opportunities.
Hosted by: Stephanie Zhan and Sonya Huang, Sequoia Capital

Mentioned in this episode:
Solving Rubik’s Cube with a robot hand: OpenAI’s original robotics research

Computer Use and Operator: Anthropic and OpenAI reasoning breakthroughs that originated with OpenAi researchers

Skild and Physical Intelligence: Robotics-oriented companies Bob sees as well-positioned now

Distyl: AI company founded by ex-Palintir alums to create enterprise workflows driven by proprietary data

Member of the technical staff: Title at OpenAI designed to break down barriers between AI researchers and engineers

Howie.ai: Scheduling app that Bob uses
...more
49min

FAQs about Training Data:

How many episodes does Training Data have?

The podcast currently has 61 episodes available.

More shows like Training Data

This Week in Startups by Jason Calacanis

This Week in Startups

1,268 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

1,063 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch by Harry Stebbings

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

522 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

223 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

92 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

427 Listeners

Lenny's Podcast: Product | Career | Growth by Lenny Rachitsky

Lenny's Podcast: Product | Career | Growth

1,361 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

129 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

89 Listeners

Crucible Moments by Sequoia Capital

Crucible Moments

90 Listeners

The Ben & Marc Show by Marc Andreessen, Ben Horowitz

The Ben & Marc Show

123 Listeners

BG2Pod with Brad Gerstner and Bill Gurley by BG2Pod

BG2Pod with Brad Gerstner and Bill Gurley

461 Listeners

AI + a16z by a16z

AI + a16z

31 Listeners

Lightcone Podcast by Y Combinator

Lightcone Podcast

18 Listeners

Uncapped with Jack Altman by Alt Capital

Uncapped with Jack Altman

33 Listeners

Cheeky Pint by Stripe

Cheeky Pint

13 Listeners