PMLE domain - 18% of the exam

Automating and Orchestrating ML Pipelines

Automating and Orchestrating ML Pipelines is 18% of the Google Cloud Professional Machine Learning Engineer (PMLE) exam. These are the objectives it covers, each with practice questions and worked explanations.

Objectives in this domain

Sample question from this domain

Free sampleAutomating and Orchestrating ML Pipelinesmedium

A bank runs a fraud-detection model whose accuracy decays as fraudsters change tactics, but fraud patterns shift unpredictably rather than on a fixed weekly or monthly cadence. The team wants retraining to fire only when the model is genuinely degrading, to avoid wasting compute on needless runs. Which retraining policy best fits this situation?

  • ASchedule a retraining run every night so the model is always built on the freshest labelled transactions available.
  • BRetrain manually whenever an analyst happens to notice that several fraud cases were missed during a review.
  • CTrigger retraining when monitored prediction performance or input drift crosses a defined threshold, so runs happen only on genuine degradation. Correct
  • DRetrain on a fixed monthly calendar schedule because most regulated banking processes operate on monthly reporting cycles.
Match the retraining trigger to the drift pattern: use performance or drift thresholds when degradation is unpredictable rather than periodic. Concept drift that arrives unpredictably is not captured by a calendar; tying retraining to a monitored performance or input-drift threshold makes the pipeline fire on actual degradation, conserving compute while keeping the model fresh exactly when it matters.

Why A is wrong: Nightly scheduling is tempting because fresh data sounds safer, but it ignores the stated goal of avoiding needless runs and retrains even when performance is stable, wasting compute.

Why B is wrong: Manual ad hoc retraining feels responsive, but it relies on human spotting, reacts slowly and inconsistently, and is not a defined automated policy tied to measured degradation.

Why C is correct: A performance- or drift-threshold trigger fires exactly when the model degrades, which matches unpredictable concept drift and the goal of retraining only when accuracy actually falls.

Why D is wrong: A monthly calendar feels orderly and aligns with reporting, but unpredictable drift can degrade the model mid-month, leaving it stale until the next fixed run arrives.

Other domains in this exam

See also the PMLE cert hub, the study guide, and the cheat sheet.

Examworthy is not affiliated with or endorsed by Google Cloud. Original, blueprint-aligned practice material only.