Super Data Science: ML & AI Podcast with Jon Krohn

SDS 595: Data Engineering 101

07.26.2022 - By Jon KrohnPlay

Download our free app to listen on your phone

Download on the App StoreGet it on Google Play

Tune in as Joe Reis and Matt Housley, co-founders of Ternary Data and co-authors of the book “Fundamentals of Data Engineering” join Jon Krohn to discuss major undercurrents across the data engineering lifecycle, and their top tools and techniques.

In this episode you will learn:

• What is data engineering? [3:55]

• Why Joe and Matt identify as “recovering data scientists” [6:12]

• What kinds of people tend to become data scientists vs. data engineers [10:38]?

• Key components of Joe and Matt’s book [26:31]

• Major undercurrents across the data engineering lifecycle [28:26]

• The most under-utilized tool in a data engineer's toolbox [34:39]

• How there are tradeoffs in any data pipeline latency considerations, but faster is typically the default assumption [38:55]

• Joe and Matt’s favorite data engineering tools and techniques [43:39]

Additional materials: www.superdatascience.com/595

More episodes from Super Data Science: ML & AI Podcast with Jon Krohn