PDE - Ingesting and Processing Data - Section 2.2

Build data pipelines using Dataflow, Apache Beam, Dataproc, Cloud Data Fusion, BigQuery, Pub/Sub, Kafka, Spark, and Hadoop, including batch and streaming transformations with windowing and late-arriving data handling.

Build batch and streaming pipelines using Apache Beam on Dataflow, Spark and Hadoop on Dataproc, and Cloud Data Fusion for low-code integration, applying windowing strategies and late-data handling to produce correct aggregations across out-of-order event streams.

Apache BeamDataprocCloud Data FusionWindowingLate data

Build data pipelines using Dataflow, Apache Beam, Dataproc, Cloud Data Fusion, BigQuery, Pub/Sub, Kafka, Spark, and Hadoop, including batch and streaming transformations with windowing and late-arriving data handling.

More in this domain