SRE principles and practices
SRE definition, history, SRE and DevOps and SRE Principles and Practices
Service Level Objectives and Error Budgets
Understanding Service Level Objectives (SLOs) / Error Budgets / Error Budget Policies / Setting SLOs for an Organization
Reducing Toil
Understanding toil and why it is bad / Human and organizational opportunities to reduce toil
Monitoring and Service Level Indicators
Understanding Service Level Indicators (SLIs) and how they relate to Service Level Objectives (SLOs) / The monitoring landscape / Observability and setting measurable service objectives
SRE tools and automation
Automation defined / DevOps and SRE automation focus / Types of SRE automation / Tooling landscape overview Introduction to AIOps / Observability / Progressive Deployments / Value Stream Management / Generative AI and Platform Engineering
Anti-fragility and learning from failure
The benefits of learning from failure / Anti-fragility defined / Shifting organizational balance / Fire drills / Chaos Engineering
Organizational impact of SRE
Why organizations embrace SRE / Patterns for SRE adoption / Organizational impact of SRE / SRE job description / Sustainable incident response / Blameless post-mortems / Scaling SRE
SRE, other frameworks, the future
SRE and DevOps, Agile, ITSM / The evolution of SRE / SRE specialty roles such as Network Reliability Engineering and Customer Reliability Engineering