Data Archives - Software Engineering Daily

Apache Arrow with Uwe Korn


Listen Later

In a typical data analytics system, there are a variety of technologies interacting. HDFS for storing files, Spark for distributed machine learning, pandas for data analysis in Python–each of these different technologies has a different format for how data is represented.   Serialization and deserialization between these different formats causes significant latency across the overall
...more
View all episodesView all episodes
Download on the App Store

Data Archives - Software Engineering DailyBy Data Archives - Software Engineering Daily

  • 4
  • 4
  • 4
  • 4
  • 4

4

28 ratings