Cloud Engineering Archives - Software Engineering Daily

Apache Beam with Frances Perry


Listen Later

Unbounded data streams create difficult challenges for our application architectures. The data never stops coming, and we are forced to assume that we will never know if or when we have seen all of our data. Some streaming systems give us the tools to deal partially with unbounded data streams, but we have to complement those streaming systems with batch processing, in a technique known as the Lambda Architecture.

Apache Beam is a unified model for defining and executing data processing workflows, and Frances Perry joins the show to explain how Beam provides a way for us to model our data processing, agnostic of whether we choose to run those workflows on Spark, Flink, or Google’s Dataflow.

Links
  • Apache Beam
  • Streaming 101
  • Streaming 102
  • The Dataflow Model
  • Google Cloud Dataflow
  • Fundamentals of Stream Processing with Beam
  • Mobile Gaming Example
  • Dataflow: Beam and Spark Comparison
  • The post Apache Beam with Frances Perry appeared first on Software Engineering Daily.

    ...more
    View all episodesView all episodes
    Download on the App Store

    Cloud Engineering Archives - Software Engineering DailyBy Cloud Engineering Archives - Software Engineering Daily

    • 3.8
    • 3.8
    • 3.8
    • 3.8
    • 3.8

    3.8

    31 ratings