Data Archives - Software Engineering Daily

Spark and Streaming with Matei Zaharia


Listen Later

Apache Spark is a system for processing large data sets in parallel. The core abstraction of Spark is the resilient distributed dataset (RDD), a working set of data that sits in memory for fast, iterative processing. Matei Zaharia created Spark with two goals: to provide a composable, high-level set of APIs for performing distributed processing;
...more
View all episodesView all episodes
Download on the App Store

Data Archives - Software Engineering DailyBy Data Archives - Software Engineering Daily

  • 4
  • 4
  • 4
  • 4
  • 4

4

28 ratings