DataTalks.Club

SE4ML - Software Engineering for Machine Learning - Nadia Nahar


Listen Later

We talked about:

  • Nadia’s background
  • Academic research in software engineering
  • Design patterns
  • Software engineering for ML systems
  • Problems that people in industry have with software engineering and ML
  • Communication issues and setting requirements
  • Artifact research in open source products
  • Product vs model
  • Nadia’s open source product dataset
  • Failure points in machine learning projects
  • Finding solutions to issues using Nadia’s dataset and experience
  • The problem of siloing data scientists and other structure issues
  • The importance of documentation and checklists
  • Responsible AI
  • How data scientists and software engineers can work in an Agile way

  • Links:

    • Model Card: https://arxiv.org/abs/1810.03993
    • Datasheets: https://arxiv.org/abs/1803.09010
    • Factsheets: https://arxiv.org/abs/1808.07261
    • Research Paper: https://www.cs.cmu.edu/~ckaestne/pdf/icse22_seai.pdf
    • Arxiv version: https://arxiv.org/pdf/2110.

    • Free data engineering course: https://github.com/DataTalksClub/data-engineering-zoomcamp

      Join DataTalks.Club: https://datatalks.club/slack.html

      Our events: https://datatalks.club/events.html

      ...more
      View all episodesView all episodes
      Download on the App Store

      DataTalks.ClubBy DataTalks.Club

      • 5
      • 5
      • 5
      • 5
      • 5

      5

      7 ratings


      More shows like DataTalks.Club

      View all
      Software Engineering Radio - the podcast for professional software developers by se-radio@computer.org

      Software Engineering Radio - the podcast for professional software developers

      272 Listeners

      TED Talks Daily by TED

      TED Talks Daily

      11,175 Listeners

      The Changelog: Software Development, Open Source by Changelog Media

      The Changelog: Software Development, Open Source

      285 Listeners

      Freakonomics Radio by Freakonomics Radio + Stitcher

      Freakonomics Radio

      31,914 Listeners

      Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

      Super Data Science: ML & AI Podcast with Jon Krohn

      296 Listeners

      Machine Learning Guide by OCDevel

      Machine Learning Guide

      764 Listeners

      DataFramed by DataCamp

      DataFramed

      268 Listeners

      Learning Bayesian Statistics by Alexandre Andorra

      Learning Bayesian Statistics

      66 Listeners

      The Real Python Podcast by Real Python

      The Real Python Podcast

      138 Listeners

      声动早咖啡 by 声动活泼

      声动早咖啡

      263 Listeners