May 08, 2025

Ep 64: GPT 4.1 Lead at OpenAI Michelle Pokrass: RFT Launch, How OpenAI Improves Its Models & the State of AI Agents Today

47 minutes

In this episode, I sit down with Michelle Pokrass, who leads a research team at OpenAI within post-training focused on improving models for power users: developers using OpenAI models in the API and power users in ChatGPT. We unpack how OpenAI prioritized instruction-following and long context, why evals have a 3-month shelf life, what separates successful AI startups, and how the best teams are fine-tuning to push past the current frontier.

If you’ve ever wondered how OpenAI really decides what to build, and how it affects what you should build, this one’s for you.

(0:00) Intro

(1:03) Deep Dive into GPT-4.1 Development

(2:23) User Feedback and Model Evaluation

(4:01) Challenges and Improvements in Model Training

(5:54) Advancements in AI Coding Capabilities

(9:11) Future of AI Models and Fine-Tuning

(20:44) Multimodal Capabilities

(22:59) Deep Tech Applications and Data Efficiency

(24:14) Preference Fine Tuning vs. RFT

(26:29) Choosing the Right Model for Your Needs

(28:18) Prompting Techniques and Model Improvements

(32:10) Future Research and Model Enhancements

(39:14) Power Users and Personalization

(40:22) Personal Journey and Organizational Growth

(43:37) Quickfire

With your co-hosts:

@jacobeffron

- Partner at Redpoint, Former PM Flatiron Health

@patrickachase

- Partner at Redpoint, Former ML Engineer LinkedIn

@ericabrescia

- Former COO Github, Founder Bitnami (acq’d by VMWare)

@jordan_segall

- Partner at Redpoint

...more

View all episodes

By by Redpoint Ventures

4.9

4949 ratings

May 08, 2025

Ep 64: GPT 4.1 Lead at OpenAI Michelle Pokrass: RFT Launch, How OpenAI Improves Its Models & the State of AI Agents Today

47 minutes

If you’ve ever wondered how OpenAI really decides what to build, and how it affects what you should build, this one’s for you.

(0:00) Intro

(1:03) Deep Dive into GPT-4.1 Development

(2:23) User Feedback and Model Evaluation

(4:01) Challenges and Improvements in Model Training

(5:54) Advancements in AI Coding Capabilities

(9:11) Future of AI Models and Fine-Tuning

(20:44) Multimodal Capabilities

(22:59) Deep Tech Applications and Data Efficiency

(24:14) Preference Fine Tuning vs. RFT

(26:29) Choosing the Right Model for Your Needs

(28:18) Prompting Techniques and Model Improvements

(32:10) Future Research and Model Enhancements

(39:14) Power Users and Personalization

(40:22) Personal Journey and Organizational Growth

(43:37) Quickfire

With your co-hosts:

@jacobeffron

- Partner at Redpoint, Former PM Flatiron Health

@patrickachase

- Partner at Redpoint, Former ML Engineer LinkedIn

@ericabrescia

- Former COO Github, Founder Bitnami (acq’d by VMWare)

@jordan_segall

- Partner at Redpoint

...more

More shows like Unsupervised Learning

View all

This Week in Startups

1,273 Listeners

a16z Podcast

1,061 Listeners

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

521 Listeners

Y Combinator Startup Podcast

225 Listeners

Practical AI

189 Listeners

Machine Learning Street Talk (MLST)

91 Listeners

Dwarkesh Podcast

424 Listeners

No Priors: Artificial Intelligence | Technology | Startups

125 Listeners

Latent Space: The AI Engineer Podcast

69 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

509 Listeners

BG2Pod with Brad Gerstner and Bill Gurley

465 Listeners

AI + a16z

32 Listeners

Lightcone Podcast

19 Listeners

Training Data

44 Listeners

Uncapped with Jack Altman

33 Listeners

Share Ep 64: GPT 4.1 Lead at OpenAI Michelle Pokrass: RFT Launch, How OpenAI Improves Its Models & the State of AI Agents Today

Sign up to save your podcasts

Ep 64: GPT 4.1 Lead at OpenAI Michelle Pokrass: RFT Launch, How OpenAI Improves Its Models & the State of AI Agents Today

Ep 64: GPT 4.1 Lead at OpenAI Michelle Pokrass: RFT Launch, How OpenAI Improves Its Models & the State of AI Agents Today

More shows like Unsupervised Learning

This Week in Startups

a16z Podcast

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

Y Combinator Startup Podcast

Practical AI

Machine Learning Street Talk (MLST)

Dwarkesh Podcast

No Priors: Artificial Intelligence | Technology | Startups

Latent Space: The AI Engineer Podcast

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

BG2Pod with Brad Gerstner and Bill Gurley

AI + a16z

Lightcone Podcast

Training Data

Uncapped with Jack Altman