What is SRE?
Site Reliability Engineering
A practice that interfaces DevOps that aims to minimise effect of software changes on end users.
It does this by automating as much of the IT infrastructure as possible to provide more reliable and stable software for end users should software updates or crashes occur.
Define what is meant by Site Reliability
The stability and reliability of software in production. Infrastructure should be in place to safeguard end users against updates, crashes etc.
Why is SRE important?
How does SRE improve collaboration between development and operations?
The goal is to provide an IT infrastructure that minimises effect of production changes on the end users.
Development write updates, fix errors etc. Operations deploy the software to production in a stable way.
Collaboration is constantly required to achieve continued stability and continuous improvement.
What is DevOps?
Define the traditional responsibility of development and operations positions?
Development deals wit the production of code
Operations deals with deploying and maintaining the software for end users
What were the issues with traditional development to operations waterfall transition?
How does a DevOps team operate?
What are the 5 DevOps principles? Explain each…
Culture: Collaboration and knowledge sharing are vital for effective DevOps.
Automate: Automating as much as possible reduces toil tasks and human error and speeds up releases.
Lean: Automation and removal of waste (toil tasks) for tight feedback and quick releases ensures a lean way of working with efficient software releases.
Measure: Track metrics to enable automation and create user-centric software. Metrics enable self-evaluation.
Share: Sharing knowledge across teams and organisations ensures shared learning and prevents repeat mistakes.
What are the pros of DevOps?
What are the cons of DevOps?