DataTalks.Club

SE4ML - Software Engineering for Machine Learning - Nadia Nahar


Listen Later

We talked about:

  • Nadia’s background
  • Academic research in software engineering
  • Design patterns
  • Software engineering for ML systems
  • Problems that people in industry have with software engineering and ML
  • Communication issues and setting requirements
  • Artifact research in open source products
  • Product vs model
  • Nadia’s open source product dataset
  • Failure points in machine learning projects
  • Finding solutions to issues using Nadia’s dataset and experience
  • The problem of siloing data scientists and other structure issues
  • The importance of documentation and checklists
  • Responsible AI
  • How data scientists and software engineers can work in an Agile way

  • Links:

    • Model Card: https://arxiv.org/abs/1810.03993
    • Datasheets: https://arxiv.org/abs/1803.09010
    • Factsheets: https://arxiv.org/abs/1808.07261
    • Research Paper: https://www.cs.cmu.edu/~ckaestne/pdf/icse22_seai.pdf
    • Arxiv version: https://arxiv.org/pdf/2110.

    • Free data engineering course: https://github.com/DataTalksClub/data-engineering-zoomcamp

      Join DataTalks.Club: https://datatalks.club/slack.html

      Our events: https://datatalks.club/events.html

      ...more
      View all episodesView all episodes
      Download on the App Store

      DataTalks.ClubBy DataTalks.Club

      • 5
      • 5
      • 5
      • 5
      • 5

      5

      7 ratings


      More shows like DataTalks.Club

      View all
      Talk Python To Me by Michael Kennedy

      Talk Python To Me

      583 Listeners

      Data Career Podcast: Helping You Land a Data Analyst Job FAST by Avery Smith - Data Career Coach

      Data Career Podcast: Helping You Land a Data Analyst Job FAST

      156 Listeners