Apache Beam

beam.apache.org

A unified model to define and execute data processing pipelines

Apache Beam provides an advanced unified programming model, allowing you to implement batch and streaming data processing jobs that can run on any execution engine. It is easy to use with Apache Apex, Apache Flink, Apache Spark, and Google Cloud Dataflow among other distributed processing back-ends.

Apache Beam developed out of a number of internal Google technologies, including MapReduce, FlumeJava, and Millwheel. Google donated the code to the Apache Software Foundation in 2016, and Googlers continue to contribute regularly to the project.

Apache, Apache Beam and the orange letter B logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries.

How Google uses Apache Beam

Apache Beam is an open source successor of the SDKs used internally to power the majority of Google’s massive-scale data processing pipelines.

Repo

beam
License

Apache 2.0
Categories

Cloud Data processing Operating systems
Languages

Java Python

Apache Beam

A unified model to define and execute data processing pipelines

How Google uses Apache Beam

Repo

License

Categories

Languages