DataTalks.Club

SE4ML - Software Engineering for Machine Learning - Nadia Nahar


Listen Later

We talked about:

  • Nadia’s background
  • Academic research in software engineering
  • Design patterns
  • Software engineering for ML systems
  • Problems that people in industry have with software engineering and ML
  • Communication issues and setting requirements
  • Artifact research in open source products
  • Product vs model
  • Nadia’s open source product dataset
  • Failure points in machine learning projects
  • Finding solutions to issues using Nadia’s dataset and experience
  • The problem of siloing data scientists and other structure issues
  • The importance of documentation and checklists
  • Responsible AI
  • How data scientists and software engineers can work in an Agile way

  • Links:

    • Model Card: https://arxiv.org/abs/1810.03993
    • Datasheets: https://arxiv.org/abs/1803.09010
    • Factsheets: https://arxiv.org/abs/1808.07261
    • Research Paper: https://www.cs.cmu.edu/~ckaestne/pdf/icse22_seai.pdf
    • Arxiv version: https://arxiv.org/pdf/2110.

    • Free data engineering course: https://github.com/DataTalksClub/data-engineering-zoomcamp

      Join DataTalks.Club: https://datatalks.club/slack.html

      Our events: https://datatalks.club/events.html

      ...more
      View all episodesView all episodes
      Download on the App Store

      DataTalks.ClubBy DataTalks.Club

      • 5
      • 5
      • 5
      • 5
      • 5

      5

      7 ratings


      More shows like DataTalks.Club

      View all
      Radiolab by WNYC Studios

      Radiolab

      44,003 Listeners

      Hidden Brain by Hidden Brain, Shankar Vedantam

      Hidden Brain

      43,647 Listeners

      The Knowledge Project by Shane Parrish

      The Knowledge Project

      2,673 Listeners

      Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

      Super Data Science: ML & AI Podcast with Jon Krohn

      303 Listeners

      Data Engineering Podcast by Tobias Macey

      Data Engineering Podcast

      144 Listeners

      The Real Python Podcast by Real Python

      The Real Python Podcast

      141 Listeners

      Huberman Lab by Scicomm Media

      Huberman Lab

      29,206 Listeners

      The Ezra Klein Show by New York Times Opinion

      The Ezra Klein Show

      16,029 Listeners

      ReThinking by TED

      ReThinking

      624 Listeners

      Data Career Podcast: Helping You Land a Data Analyst Job FAST by Avery Smith - Data Career Coach

      Data Career Podcast: Helping You Land a Data Analyst Job FAST

      160 Listeners

      The Analytics Engineering Podcast by dbt Labs, Inc.

      The Analytics Engineering Podcast

      28 Listeners

      The Tucker Carlson Show by Tucker Carlson Network

      The Tucker Carlson Show

      17,023 Listeners