Apache Beam provides an advanced unified programming model, allowing you to implement batch and streaming data processing jobs that can run on any execution engine. It is easy to use with Apache Apex, Apache Flink, Apache Spark, and Google Cloud Dataflow among other distributed processing back-ends.

Apache Beam developed out of a number of internal Google technologies, including MapReduce, FlumeJava, and Millwheel. Google donated the code to the Apache Software Foundation in 2016, and Googlers continue to contribute regularly to the project.

Apache, Apache Beam and the orange letter B logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries.

Apache Beam is an open source successor of the SDKs used internally to power the majority of Google’s massive-scale data processing pipelines.