What is an SRE?
Site Reliability Engineer
Their approach to operations uses software as the primary tool for managing systems or improving the system (automation).
What is toil?
Toil is mundane and repetitive tasks that could be automated because it follows a strict pattern.
3 good practices for managing toil
1) Allocating strategic time to reducing/automating toil.
2) Do a cost benefit analysis to determine if automating the toil is worth it.
3) Keep a toil reduction backlog.
What are 3 responsibilities of an SRE?
1) Eliminating toil
2) Working to service levels
3) Managing failure
What is SLA?
Service Level Agreement
It is the target time for a service that you enter into contract with someone that uses that service.
What is SLO?
Service Level Objective
What is SLI?
Service Level Indicator
What are 3 ways that SRE differ from the traditional dev/ops split?