AI Engineering Podcast

Build More Reliable Machine Learning Systems With The Dagster Orchestration Engine


Listen Later

Summary
Building a machine learning model one time can be done in an ad-hoc manner, but if you ever want to update it and serve it in production you need a way of repeating a complex sequence of operations. Dagster is an orchestration engine that understands the data that it is manipulating so that you can move beyond coarse task-based representations of your dependencies. In this episode Sandy Ryza explains how his background in machine learning has informed his work on the Dagster project and the foundational principles that it is built on to allow for collaboration across data engineering and machine learning concerns.
Interview
  • Introduction
  • How did you get involved in machine learning?
  • Can you start by sharing a definition of "orchestration" in the context of machine learning projects?
  • What is your assessment of the state of the orchestration ecosystem as it pertains to ML?
  • modeling cycles and managing experiment iterations in the execution graph
  • how to balance flexibility with repeatability 
  • What are the most interesting, innovative, or unexpected ways that you have seen orchestration implemented/applied for machine learning?
  • What are the most interesting, unexpected, or challenging lessons that you have learned while working on orchestration of ML workflows?
  • When is Dagster the wrong choice?
  • What do you have planned for the future of ML support in Dagster?
Contact Info
  • LinkedIn
  • @s_ryz on Twitter
  • sryza on GitHub
Parting Question
  • From your perspective, what is the biggest barrier to adoption of machine learning today?
Closing Announcements
  • Thank you for listening! Don't forget to check out our other shows. The Data Engineering Podcast covers the latest on modern data management. Podcast.__init__ covers the Python language, its community, and the innovative ways it is being used.
  • Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.
  • If you've learned something or tried out a project from the show then tell us about it! Email [email protected]) with your story.
  • To help other people find the show please leave a review on iTunes and tell your friends and co-workers
Links
  • Dagster
    • Data Engineering Podcast Episode
  • Cloudera
  • Hadoop
  • Apache Spark
  • Peter Norvig
  • Josh Wills
  • REPL == Read Eval Print Loop
  • RStudio
  • Memoization
  • MLFlow
  • Kedro
    • Data Engineering Podcast Episode
  • Metaflow
    • Podcast.__init__ Episode
  • Kubeflow
  • dbt
    • Data Engineering Podcast Episode
  • Airbyte
    • Data Engineering Podcast Episode
The intro and outro music is from Hitman's Lovesong feat. Paola Graziano by The Freak Fandango Orchestra/CC BY-SA 3.0
...more
View all episodesView all episodes
Download on the App Store

AI Engineering PodcastBy Tobias Macey

  • 4.3
  • 4.3
  • 4.3
  • 4.3
  • 4.3

4.3

6 ratings


More shows like AI Engineering Podcast

View all
The Cloudcast by Massive Studios

The Cloudcast

153 Listeners

a16z Podcast by Andreessen Horowitz

a16z Podcast

994 Listeners

Software Engineering Daily by Software Engineering Daily

Software Engineering Daily

629 Listeners

Super Data Science: ML & AI Podcast with Jon Krohn by Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

296 Listeners

NVIDIA AI Podcast by NVIDIA

NVIDIA AI Podcast

322 Listeners

Data Engineering Podcast by Tobias Macey

Data Engineering Podcast

139 Listeners

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion by AI & Data Today

AI Today Podcast: Artificial Intelligence Insights, Experts, and Opinion

144 Listeners

Practical AI by Practical AI LLC

Practical AI

189 Listeners

The Stack Overflow Podcast by The Stack Overflow Podcast

The Stack Overflow Podcast

63 Listeners

Last Week in AI by Skynet Today

Last Week in AI

281 Listeners

Machine Learning Street Talk (MLST) by Machine Learning Street Talk (MLST)

Machine Learning Street Talk (MLST)

88 Listeners

No Priors: Artificial Intelligence | Technology | Startups by Conviction

No Priors: Artificial Intelligence | Technology | Startups

124 Listeners

Latent Space: The AI Engineer Podcast by swyx + Alessio

Latent Space: The AI Engineer Podcast

63 Listeners

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis by Nathaniel Whittemore

The AI Daily Brief (Formerly The AI Breakdown): Artificial Intelligence News and Analysis

423 Listeners

AI + a16z by a16z

AI + a16z

33 Listeners