Site Reliability Engineering
Culture
Getting Started with Site Reliability Engineering 109
updated 7y ago
Stephen Thorne, Google.pdf
A quick introduction to SRE principles
This GitHub repository provides a 'playground' for learning and experimenting with SRE principles.
Getting Started with Site Reliability Engineering 109
updated 7y ago
This is a PDF document found on GitHub, detailing how to get started with SRE. It's presented by Stephen Thorne.
Making operational work more visible
This guide from readme.com discusses how to make operational work more visible, a key aspect of SRE.
Reliability
Chaos Engineering resources 6.5k
updated 2y ago
A Google SRE explores GitHub reliability with BigQuery 31.7k
updated yesterday
The Production Environment at Google - Part 1 & Part 2
How we’re building a production readiness review process at Grafana Labs 72.8k
updated yesterday
Using Fault Injection Testing to Improve DoorDash Reliability