Objectives in this domain

Develop end-to-end ML pipelines that validate data and models, orchestrate managed and unmanaged services from templates or custom solutions such as Agent Platform Pipelines, Managed Service for Apache Airflow, and Ray on the Agent Platform, and ensure consistent data preprocessing between training and serving.
Section 5.1hard
Automate model retraining by determining an appropriate retraining policy and deploying models in continuous integration, continuous delivery, and continuous training pipelines using services such as Cloud Build.
Section 5.2medium

Sample question from this domain

Free sampleAutomating and Orchestrating ML Pipelinesmedium

A bank runs a fraud-detection model whose accuracy decays as fraudsters change tactics, but fraud patterns shift unpredictably rather than on a fixed weekly or monthly cadence. The team wants retraining to fire only when the model is genuinely degrading, to avoid wasting compute on needless runs. Which retraining policy best fits this situation?

ASchedule a retraining run every night so the model is always built on the freshest labelled transactions available.
BRetrain manually whenever an analyst happens to notice that several fraud cases were missed during a review.
CTrigger retraining when monitored prediction performance or input drift crosses a defined threshold, so runs happen only on genuine degradation. Correct
DRetrain on a fixed monthly calendar schedule because most regulated banking processes operate on monthly reporting cycles.

Match the retraining trigger to the drift pattern: use performance or drift thresholds when degradation is unpredictable rather than periodic. Concept drift that arrives unpredictably is not captured by a calendar; tying retraining to a monitored performance or input-drift threshold makes the pipeline fire on actual degradation, conserving compute while keeping the model fresh exactly when it matters.

Why A is wrong: Nightly scheduling is tempting because fresh data sounds safer, but it ignores the stated goal of avoiding needless runs and retrains even when performance is stable, wasting compute.

Why B is wrong: Manual ad hoc retraining feels responsive, but it relies on human spotting, reacts slowly and inconsistently, and is not a defined automated policy tied to measured degradation.

Why C is correct: A performance- or drift-threshold trigger fires exactly when the model degrades, which matches unpredictable concept drift and the goal of retraining only when accuracy actually falls.

Why D is wrong: A monthly calendar feels orderly and aligns with reporting, but unpredictable drift can degrade the model mid-month, leaving it stale until the next fixed run arrives.

Other domains in this exam

Architecting Low-Code AI Solutions12% of the exam
Collaborating Within and Across Teams to Manage Data and Models16% of the exam
Scaling Prototypes Into ML Models21% of the exam
Serving and Scaling Models20% of the exam
Monitoring AI Solutions13% of the exam

See also the PMLE cert hub, the study guide, and the cheat sheet.