Objectives in this domain

Design loosely coupled architectures using Amazon SQS, Amazon SNS and Amazon EventBridge to decouple components.
Section 2.1medium
Design scalable application integration using Amazon API Gateway, load balancers and container orchestration.
Section 2.2medium
Design elastic capacity using EC2 Auto Scaling, scaling policies and serverless scaling to match demand.
Section 2.3medium
Orchestrate decoupled, serverless workflows using AWS Step Functions, AWS Lambda and asynchronous fan-out patterns.
Section 2.4medium
Design highly available architectures using Multi-AZ deployments, cross-Region replication and automatic failover.
Section 2.5hard
Select disaster recovery strategies such as backup and restore, pilot light, warm standby and multi-site to meet RTO and RPO targets.
Section 2.6hard
Design fault-tolerant compute and storage using load balancer health checks, redundancy and durable storage.
Section 2.7medium

Sample question from this domain

Free sampleDesign Resilient Architecturesmedium

An order-processing web tier writes directly to a fleet of EC2 worker instances over HTTP. During flash sales the workers are overwhelmed and requests are dropped, but at night the workers sit idle. The team wants to absorb traffic spikes, let the workers pull work at their own pace, and stop losing orders, with the least operational effort. Which change best meets these requirements?

APlace an Application Load Balancer in front of the worker fleet and enable connection draining so that surplus order requests queue at the load balancer until a worker becomes available.
BSend each order to an Amazon SQS standard queue and have the worker instances poll the queue, so messages persist until a worker is free to process them. Correct
CPublish each order to an Amazon SNS topic and subscribe every worker instance so that all workers receive the same order and the fastest worker processes it first.
DRoute every order through an Amazon EventBridge bus with a rule that invokes the worker fleet directly, relying on EventBridge to retain orders the workers cannot yet accept.

Use an Amazon SQS queue to decouple a producer from consumers so that traffic spikes are buffered and work is pulled at the consumer's pace. SQS is a pull-based, durable message buffer. Producers enqueue messages that persist for the retention period, and consumers poll and delete them when processed, which absorbs bursts and decouples the tiers so no work is lost when consumers are saturated.

Why A is wrong: An ALB distributes synchronous requests but does not durably buffer them; when no healthy target can respond the requests time out, so orders are still lost during a spike.

Why B is correct: An SQS queue durably buffers messages and lets consumers poll at their own rate, smoothing spikes and preventing dropped orders with minimal operational effort.

Why C is wrong: SNS pushes a copy to every subscriber, so all workers would process the same order, and it does not buffer messages for slow consumers to pull later.

Why D is wrong: EventBridge routes and filters events to targets but is built for push-style delivery, not for letting a worker pool pull buffered work at its own pace.

Other domains in this exam

Design Secure Architectures30% of the exam
Design High-Performing Architectures24% of the exam
Design Cost-Optimized Architectures20% of the exam

See also the SAA-C03 cert hub, the study guide, and the cheat sheet.