SRE Weekly Issue #399

Paper: How in the World Did We Ever Get into That Mode?

This research paper summary goes into Mode Error and the dangers of adding more features to a system in the form of modes, especially if the system can change modes on its own.

Fred Hebert (summary)
Dr. Nadine B. Sarter (original paper)

Post Mortem on Cloudflare Control Plane and Analytics Outage

Cloudflare suffered a power outage in one of the datacenters housing their control and data planes. The outage itself is intriguing, and in its aftermath, Cloudflare learned that their system wasn’t as HA as they thought.

Lots of great lessons here, and if you want more, they posted another incident writeup recently.

Matthew Prince — Cloudflare

Architecture Patterns : Command Query Responsibility Segregation (CQRS)

Separating write from read workloads can increase complexity but also open the door to greater scalability, as this article explains.

Pier-Jean Malandrino

Load Shedding for High Traffic Systems

Covers four strategies for load shedding, with code examples:

Random Shedding
Priority-Based Shedding
Resource-Based Shedding
Node Isolation

Code Reliant

Handling a Regional Outage: Comparing the Response From AWS, Azure and GCP

Lots of juicy details about the three outages, including a link to AWS’s write-up of their Lambda outage in June.

Gergely Orosz

Architecture Patterns : The Circuit-Breaker

The diagrams in this article are especially useful for understanding how the circuit-breaker pattern works.

Pier-Jean Malandrino

How to be on-call

This one’s about how on-call can go bad, and how to structure your team’s on-call so to be livable and sustainable.

Michael Hart

Working Effectively With Executives During an Incident

Execs cast a big shadow in an incident, so it’s important to have a plan for how to communicate with them, as this article explains.

Ashley Sawatsky — Rootly

SRE Weekly Issue #399

Subscribe

RSS

Mastodon

Search Issues

A message from our sponsor, FireHydrant:

Subscribe

RSS

Mastodon

Search Issues