Google Cloud Dataflow is a fully managed service for executing
Apache Beam
Apache Beam is an open source unified programming model to define and execute data processing pipelines, including ETL, batch and stream (continuous) processing. Beam Pipelines are defined using one of the provided SDKs and executed in one of ...
pipelines within the
Google Cloud Platform ecosystem.
History
Google Cloud Dataflow was announced in June, 2014 and released to the general public as an open beta in April, 2015. In January, 2016 Google donated the underlying
SDK, the implementation of a local runner, and a set of IOs (
data connectors) to access Google Cloud Platform data services to
the Apache Software Foundation
The Apache Software Foundation (ASF) is an American nonprofit corporation (classified as a 501(c)(3) organization in the United States) to support a number of open source software projects. The ASF was formed from a group of developers of the ...
.
The donated code formed the original basis for
Apache Beam
Apache Beam is an open source unified programming model to define and execute data processing pipelines, including ETL, batch and stream (continuous) processing. Beam Pipelines are defined using one of the provided SDKs and executed in one of ...
.
References
External links
*
Dataflow
Cloud computing
{{Google-stub