Objectives in this domain

Configure EC2 Auto Scaling groups and scaling policies to match compute capacity to demand using launch templates and health checks.
Section 2.1medium
Scale managed databases and add caching using Amazon RDS and Aurora scaling, DynamoDB capacity modes, Amazon ElastiCache and Amazon CloudFront.
Section 2.2medium
Configure and troubleshoot Elastic Load Balancing and Amazon Route 53 health checks for highly available endpoints.
Section 2.3medium
Design fault-tolerant systems using Multi-AZ deployments, redundancy across Availability Zones and stateless design.
Section 2.4medium
Automate snapshots and backups for Amazon EC2, RDS, EBS, S3 and DynamoDB resources using AWS Backup, backup plans and backup vaults.
Section 2.5medium
Restore databases and storage to meet recovery time and recovery point objectives using point-in-time restore and versioning.
Section 2.6medium
Follow disaster recovery procedures and select a strategy across Regions from backup and restore through pilot light and warm standby.
Section 2.7hard

Sample question from this domain

Free sampleReliability and Business Continuitymedium

A web tier runs in an EC2 Auto Scaling group, and operators want the group to keep average CPU utilisation across the fleet close to 50 percent, adding or removing instances automatically as traffic rises and falls throughout the day. They want the simplest policy that maintains this set point without them defining individual thresholds for each capacity step. Which scaling policy meets this requirement with the least ongoing tuning?

AA simple scaling policy that adds two instances whenever a CPU alarm breaches and then waits for a cooldown before evaluating again.
BA step scaling policy with several CPU alarm bands that each add a different number of instances as utilisation climbs higher.
CA scheduled scaling action that sets desired capacity higher during the day and lower at night based on the usual traffic curve.
DA target tracking scaling policy on the average CPU utilisation metric with the target value set to 50 percent. Correct

Use a target tracking scaling policy to hold a metric at a chosen set point with the least manual threshold tuning. Target tracking works like a thermostat: you name a metric and a target value, and Auto Scaling provisions and manages the underlying CloudWatch alarms, computing the capacity changes needed to keep the metric near the target. This removes the per-band alarm and step design that simple and step scaling require, which is why it is the lowest-maintenance fit for a stable CPU set point.

Why A is wrong: Simple scaling reacts to one alarm with a fixed change and a blocking cooldown, so it cannot hold a continuous set point and needs the team to hand-tune the threshold and step.

Why B is wrong: Step scaling reacts faster than simple scaling but still forces operators to design and maintain every alarm band and step, which is exactly the per-threshold tuning they want to avoid.

Why C is wrong: Scheduled scaling changes capacity on a clock and is blind to the live CPU metric, so it cannot track an actual utilisation set point as traffic varies unpredictably.

Why D is correct: Target tracking creates and manages the CloudWatch alarms for you and adjusts capacity to hold the metric at the chosen set point, which is the lowest-effort way to keep CPU near 50 percent.

Other domains in this exam

Monitoring, Logging, Analysis, Remediation, and Performance Optimization22% of the exam
Deployment, Provisioning, and Automation22% of the exam
Security and Compliance16% of the exam
Networking and Content Delivery18% of the exam

See also the SOA-C03 cert hub, the study guide, and the cheat sheet.