April 17, 2025

SWE-bench & SWE-agent | Data Brew | Episode 44

Listen Later

36 minutes

In this episode, Kilian Lieret, Research Software Engineer, and Carlos Jimenez, Computer Science PhD Candidate at Princeton University, discuss SWE-bench and SWE-agent, two groundbreaking tools for evaluating and enhancing AI in software engineering.

Highlights include:
- SWE-bench: A benchmark for assessing AI models on real-world coding tasks.
- Addressing data leakage concerns in GitHub-sourced benchmarks.
- SWE-agent: An AI-driven system for navigating and solving coding challenges.
- Overcoming agent limitations, such as getting stuck in loops.
- The future of AI-powered code reviews and automation in software engineering.

...more

View all episodes

View all episodes

Download on the App Store

Download on the App Store

Get it on Google Play

Data Brew by Databricks

By Databricks

4.8

2020 ratings

April 17, 2025

SWE-bench & SWE-agent | Data Brew | Episode 44

Listen Later

36 minutes

In this episode, Kilian Lieret, Research Software Engineer, and Carlos Jimenez, Computer Science PhD Candidate at Princeton University, discuss SWE-bench and SWE-agent, two groundbreaking tools for evaluating and enhancing AI in software engineering.

Highlights include:
- SWE-bench: A benchmark for assessing AI models on real-world coding tasks.
- Addressing data leakage concerns in GitHub-sourced benchmarks.
- SWE-agent: An AI-driven system for navigating and solving coding challenges.
- Overcoming agent limitations, such as getting stuck in loops.
- The future of AI-powered code reviews and automation in software engineering.

...more

More shows like Data Brew by Databricks

The McKinsey Podcast by McKinsey & Company

The McKinsey Podcast

406 Listeners

Making Sense with Sam Harris by Sam Harris

Making Sense with Sam Harris

26,250 Listeners

Pivot by New York Magazine

Pivot

9,622 Listeners

Data Skeptic by Kyle Polich

Data Skeptic

478 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

626 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

301 Listeners

Y Combinator Startup Podcast by Y Combinator

Y Combinator Startup Podcast

228 Listeners

DataFramed by DataCamp

DataFramed

266 Listeners

The Intelligence from The Economist by The Economist

The Intelligence from The Economist

2,534 Listeners

All-In with Chamath, Jason, Sacks & Friedberg by All-In Podcast, LLC

All-In with Chamath, Jason, Sacks & Friedberg

10,182 Listeners

Barron's Streetwise by Barron's

Barron's Streetwise

1,561 Listeners

Dwarkesh Podcast by Dwarkesh Patel

Dwarkesh Podcast

576 Listeners

Coaching Real Leaders by Muriel Wilkins

Coaching Real Leaders

677 Listeners

On with Kara Swisher by Vox Media

On with Kara Swisher

3,483 Listeners

AI + a16z by a16z

AI + a16z

34 Listeners