O'Reilly Data Show Podcast

Enabling end-to-end machine learning pipelines in real-world applications


Listen Later

In this episode of the Data Show, I spoke with Nick Pentreath, principal engineer at IBM. Pentreath was an early and avid user of Apache Spark, and he subsequently became a Spark committer and PMC member. Most recently his focus has been on machine learning, particularly deep learning, and he is part of a group within IBM focused on building open source tools that enable end-to-end machine learning pipelines.

We had a great conversation spanning many topics, including:

  • AI Fairness 360 (AIF360), a set of fairness metrics for data sets and machine learning models
  • Adversarial Robustness Toolbox (ART), a Python library for adversarial attacks and defenses.
  • Model Asset eXchange (MAX), a curated and standardized collection of free and open source deep learning models.
  • Tools for model development, governance, and operations, including MLflow, Seldon Core, and Fabric for deep learning
  • Reinforcement learning in the enterprise, and the emergence of relevant open source tools like Ray.
  • Related resources:

    • “Modern Deep Learning: Tools and Techniques”—a new tutorial at the Artificial Intelligence conference in San Jose
    • Harish Doddi on “Simplifying machine learning lifecycle management”
    • Sharad Goel and Sam Corbett-Davies on “Why it’s hard to design fair machine learning models”
    • “Managing risk in machine learning”: considerations for a world where ML models are becoming mission critical
    • “The evolution and expanding utility of Ray”
    • “Local Interpretable Model-Agnostic Explanations (LIME): An Introduction”
    • Forough Poursabzi Sangdeh on why “It’s time for data scientists to collaborate with researchers in other disciplines”
    • ...more
      View all episodesView all episodes
      Download on the App Store

      O'Reilly Data Show PodcastBy O'Reilly Media

      • 4
      • 4
      • 4
      • 4
      • 4

      4

      63 ratings


      More shows like O'Reilly Data Show Podcast

      View all
      The Changelog: Software Development, Open Source by Changelog Media

      The Changelog: Software Development, Open Source

      285 Listeners

      O'Reilly Radar Podcast - O'Reilly Media Podcast by O'Reilly Media

      O'Reilly Radar Podcast - O'Reilly Media Podcast

      35 Listeners

      Data Skeptic by Kyle Polich

      Data Skeptic

      473 Listeners

      Talk Python To Me by Michael Kennedy

      Talk Python To Me

      584 Listeners

      Software Engineering Daily by Software Engineering Daily

      Software Engineering Daily

      631 Listeners

      O'Reilly Design Podcast - O'Reilly Media Podcast by O'Reilly Media

      O'Reilly Design Podcast - O'Reilly Media Podcast

      8 Listeners

      AWS Podcast by Amazon Web Services

      AWS Podcast

      200 Listeners

      Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

      Super Data Science: ML & AI Podcast with Jon Krohn

      293 Listeners

      Python Bytes by Michael Kennedy and Brian Okken

      Python Bytes

      212 Listeners

      NVIDIA AI Podcast by NVIDIA

      NVIDIA AI Podcast

      323 Listeners

      Machine Learning Guide by OCDevel

      Machine Learning Guide

      755 Listeners

      DataFramed by DataCamp

      DataFramed

      271 Listeners

      Practical AI by Practical AI LLC

      Practical AI

      194 Listeners

      Last Week in AI by Skynet Today

      Last Week in AI

      280 Listeners

      安住紳一郎の日曜天国 by TBS RADIO

      安住紳一郎の日曜天国

      168 Listeners

      This Day in AI Podcast by Michael Sharkey, Chris Sharkey

      This Day in AI Podcast

      191 Listeners

      The AI Fundamentalists by Dr. Andrew Clark & Sid Mangalik

      The AI Fundamentalists

      9 Listeners