AWS 9.4 Reliability Flashcards

(5 cards)

1
Q

A system runs on a single large EC2 instance and crashes during peak demand. Which reliability principles are violated and how should the architecture be improved?

A

Violated:
• Scale horizontally
• Avoid single point of failure

Fix:
• Replace with multiple smaller instances
• Use load balancing + Auto Scaling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

A company has backup systems but has never tested them. During failure, recovery does not work. Which principle is missing and why is it critical?

A

Test recovery procedures

It is critical because untested recovery plans often fail in real scenarios.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A system detects failure but requires manual intervention to recover. What reliability principle is missing and how should it be implemented?

A

Automatically recover from failure

Implement using:
• Monitoring + alarms
• Auto recovery mechanisms
• Self-healing systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A company provisions fixed capacity based on estimates and frequently over/under-provisions resources. What principle fixes this and how?

A

Stop guessing capacity

Use:
• Monitoring metrics
• Auto Scaling to dynamically adjust resources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

A system handles failures well but breaks whenever updates are deployed. Which reliability concept is being ignored?

A

Manage change in automation

Changes should be:
• Automated
• Controlled
• Tested before deployment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly