Complex Systems with Patrick McKenzie (patio11)

AI and the great developer speed-up, with Joel Becker of METR


Listen Later

This week on Complex Systems, Patrick McKenzie (patio11) is joined by Joel Becker from METR. They discuss groundbreaking research on AI coding assistants.


Joel et al’s randomized controlled trial of 16 expert developers working on major open source projects revealed a counterintuitive finding: despite predictions of 24-40% speed improvements, developers actually took 19% longer to complete tasks when using AI tools, even though they retrospectively believed they were 20% faster. The conversation explores why even sophisticated professionals struggle to accurately assess their own productivity with AI tools, the industrial organization of software development, and the implications for AI's recursive self-improvement in research and development. It also touches on other perspectives from software developers using these tools professionally, and where we can expect them to improve rapidly.

Full transcript available here: www.complexsystemspodcast.com/the-great-developer-speed-up-with-joel-becker/


Sponsor:
This episode is brought to you by Mercury, the fintech trusted by 200K+ companies — from first milestones to running complex systems. Mercury offers banking that truly understands startups and scales with them. Start today at Mercury.com 

Mercury is a financial technology company, not a bank. Banking services provided by Choice Financial Group, Column N.A., and Evolve Bank & Trust; Members FDIC.

Recommended in this episode:

  • METR: https://metr.org/ 
  • Joel Becker’s site: https://joel-becker.com/ 

Timestamps:

(00:00) Intro

(00:34) Understanding AI evaluation methods

(02:04) METR's unique approach to AI evaluation

(03:10) The evolution of AI capabilities

(06:44) AI as coding assistants

(09:15) Research on AI's impact on developer productivity

(13:55) Sponsor: Mercury

(15:07) Challenges in measuring developer productivity

(20:38) Insights from the research paper

(31:26) The formalities of software development

(32:07) Automated tools and human discussions

(32:47) AI and style transfer in software

(34:35) The role of comments in AI coding

(36:51) The future of AI in software engineering

(40:25) Economic implications of AI in software

(46:53) Challenges and risks of AI in software

(59:03) Security concerns with AI-generated code

(01:04:59) Wrap


...more
View all episodesView all episodes
Download on the App Store

Complex Systems with Patrick McKenzie (patio11)By Patrick McKenzie

  • 4.9
  • 4.9
  • 4.9
  • 4.9
  • 4.9

4.9

140 ratings


More shows like Complex Systems with Patrick McKenzie (patio11)

View all
Odd Lots by Bloomberg

Odd Lots

2,000 Listeners

EconTalk by Russ Roberts

EconTalk

4,267 Listeners

Conversations with Tyler by Mercatus Center at George Mason University

Conversations with Tyler

2,457 Listeners

The a16z Show by Andreessen Horowitz

The a16z Show

1,105 Listeners

Macro Musings with David Beckworth by Mercatus Center at George Mason University

Macro Musings with David Beckworth

385 Listeners

Invest Like the Best with Patrick O'Shaughnessy by Colossus | Investing & Business Podcasts

Invest Like the Best with Patrick O'Shaughnessy

2,346 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

100 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

551 Listeners

ACQ2 by Acquired by Ben Gilbert and David Rosenthal

ACQ2 by Acquired

301 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

146 Listeners

Latent Space: The AI Engineer Podcast by Latent.Space

Latent Space: The AI Engineer Podcast

102 Listeners

"Econ 102" with Noah Smith and Erik Torenberg by Turpentine

"Econ 102" with Noah Smith and Erik Torenberg

147 Listeners

Money Stuff: The Podcast by Bloomberg

Money Stuff: The Podcast

402 Listeners

AI + a16z by a16z

AI + a16z

34 Listeners

TBPN by John Coogan & Jordi Hays

TBPN

139 Listeners