Salt is Cloudflare’s configuration management tool.
How do you find the root cause of a configuration management failure when you have a peak of hundreds of changes in 15 minutes on thousands of servers?
The result of this has been a reduction in the duration of software release delays, and an overall reduction in toilsome, repetitive triage for SRE.
Opeyemi Onikute, Menno Bezema, Nick Rhodes — Cloudflare
In this post, I’ll give a high-level overview of what Temporal offers users, the problems we were experiencing operating Spinnaker that motivated its initial adoption at Netflix, and how Temporal helped us reduce the number of transient deployment failures at Netflix from 4% to 0.0001%.
Jacob Meyers and Rob Zienert — Netflix
DrP provides an SDK that teams can use to define “analyzers” to perform investigations, plus post-processors to perform mitigations, notifications, and more.
Shubham Somani, Vanish Talwar, Madhura Parikh, Chinmay Gandhi — Meta
This article goes in detail on the ways the QA folks can reskill and map their responsibilities and skills to SRE practices.
Nidhi Sharma — DZone
“Correction of Error” is the name used by Amazon for their incident review processand there’s a lot to unpack there.
Lorin Hocshtein
In 2019, Charity Majors came down hard on deploy freezes with an article, Friday Deploy Freezes are Exactly Like Murdering Puppies.
This one takes a more moderate approach: maybe a deploy freeze is the right choice for your organization, but you should work to understand why rather than assuming.
Charity Majors
A piece defining the term “resilience”, with an especially interesting discussion of the inherent trade-off between efficiency and resiliency.
Uwe Friedrichsen
Honeycomb experienced a major, extended incident in December, and they published this (extensive!) interim report. Resolution required multiple days’ worth of engineering on new functionality and procedures related to Kafka. A theme of managing employees’ energy and resources is threaded throughout the report.
Honeycomb
