Software Engineering Daily

Great Expectations: Data Pipeline Testing with Abe Gong

02.17.2020 - By Software Engineering DailyPlay

Download our free app to listen on your phone

Download on the App StoreGet it on Google Play

A data pipeline is a series of steps that takes large data sets and creates usable results from them. At the beginning of a data pipeline, a data set might be pulled from a database, a distributed file system, or a Kafka topic. Throughout a data pipeline, different data sets are joined, filtered, and statistically

More episodes from Software Engineering Daily