PDE - Ingesting and Processing Data - Section 2.2

Build data pipelines using Dataflow, Apache Beam, Dataproc, Cloud Data Fusion, BigQuery, Pub/Sub, Kafka, Spark, and Hadoop, including batch and streaming transformations with windowing and late-arriving data handling.

Build batch and streaming pipelines using Apache Beam on Dataflow, Spark and Hadoop on Dataproc, and Cloud Data Fusion for low-code integration, applying windowing strategies and late-data handling to produce correct aggregations across out-of-order event streams.

Apache BeamDataprocCloud Data FusionWindowingLate data

More in this domain

Back to all Ingesting and Processing Data objectives, or the PDE cert hub.

Examworthy is not affiliated with or endorsed by Google Cloud. Original, blueprint-aligned practice material only.