What is the role of an SRE?
What other roles to SRE’s usually collaborate closely with?
Software Developers and System Administrators.
What principles does the SRE role work by?
CALMS
SRE personnel can either be independent teams or embedded into cross-functional teams. When is each appropriate?
What is a benefit of having independent SRE teams?
Easier to share knowledge with other SRE teams across the organisation.
What is a benefit of having embedded SRE teams?
Less communication overhead when collaborating with team.
Regarding the Culture principle. What are the 2 main aspects of ensuring good culture in an SRE team?
Blamelessness -> No finger pointing, culture of confidence and unity.
Shared Knowledge -> Tight communication loops and shared post mortem reports.
What is a Post-mortem report?
A log of an incident, the resulting impact, and the actions taken to resolve the issue.
Regarding the Automation principle, what are the 4 reasons this is important?
Automation helps:
- Eliminate Toil tasks.
- Reduce human error
- Faster
- More reliable
Define a toil task…
What type of tasks are ideal for automation?
Toil tasks
How many incidents should an SRE deal with per shift? Give reasons…
How can SRE’s implement the Lean culture principle?
In a Control Loop driven by an Error Budget, what happens if the Error Budget is positive or negative?
Positive: Developers can release more features into production.
Negative: Developers can’t release any more feature into production.
Why is Measurements an important SRE principle?
What are the 3 core measurements for a SRE to monitor?
Why is the Sharing an important SRE principle?