DataFramed

#49 Data Science Tool Building


Listen Later

Hugo speaks with Wes McKinney, creator of the pandas project for data analysis tools in Python and author of Python for Data Analysis, among many other things. Wes and Hugo talk about data science tool building, what it took to get pandas off the ground and how he approaches building “human interfaces to data” to make individuals more productive. On top of this, they’ll talk about the future of data science tooling, including the Apache arrow project and how it can facilitate this future, the importance of DataFrames that are portable between programming languages and building tools that facilitate data analysis work in the big data limit. Pandas initially arose from Wes noticing that people were nowhere near as productive as they could be due to lack of tooling & the projects he’s working on today, which they’ll discuss, arise from the same place and present a bold vision for the future.LINKS FROM THE SHOWDATAFRAMED SURVEY

  • DataFramed Survey (take it so that we can make an even better podcast for you)

DATAFRAMED GUEST SUGGESTIONS

  • DataFramed Guest Suggestions (who do you want to hear on Season 2?)

FROM THE INTERVIEW

  • Wes on Twitter
  • Roads and Bridges: The Unseen Labor Behind Our Digital Infrastructure by Nadia Eghbal
  • pandas, an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
  • Ursa Labs

FROM THE SEGMENTS

Data Science Best Practices (with Ben Skrainka ~17:10)

  • To Explain or To Predict? (By Galit Shmueli)
  • Statistical Modeling: The Two Cultures (By Leo Breiman)
  • The Book of Why (By Judea Pearl & Dana Mackenzie)

Studies in Interpretability (with Peadar Coyle at ~39:00)

  • Modelling Loss Curves in Insurance with RStan (By Mick Cooney)
  • Lime: Explaining the predictions of any machine learning classifier 
  • Probabilistic Programming Primer

Original music and sounds by The Sticks.

...more
View all episodesView all episodes
Download on the App Store

DataFramedBy DataCamp

  • 4.9
  • 4.9
  • 4.9
  • 4.9
  • 4.9

4.9

264 ratings


More shows like DataFramed

View all
Data Skeptic by Kyle Polich

Data Skeptic

470 Listeners

Talk Python To Me by Michael Kennedy

Talk Python To Me

586 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

628 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

296 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

324 Listeners

Data Engineering Podcast by Tobias Macey

Data Engineering Podcast

140 Listeners

Practical AI by Practical AI LLC

Practical AI

190 Listeners

The Stack Overflow Podcast by The Stack Overflow Podcast

The Stack Overflow Podcast

63 Listeners

The Real Python Podcast by Real Python

The Real Python Podcast

136 Listeners

Last Week in AI by Skynet Today

Last Week in AI

282 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

87 Listeners

This Day in AI Podcast by Michael Sharkey, Chris Sharkey

This Day in AI Podcast

189 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

63 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

421 Listeners

Training Data by Sequoia Capital

Training Data

36 Listeners