SRE Weekly Issue #383

Articles

This delightful talk explores what SRE can look like in practical terms by learning about the sociotechnical situation at a fictitious company. To do that, Amy Tobey plays a game she created, walking through a town and talking to NPCs.

Amy Tobey — InfoQ

Querying & Ingest outage [Honeycomb]

Honeycomb had a major outage last tuesday, and they posted this interim outage report on their status page.

Note: Honeycomb is my employer, and I proofread this article.

Honeycomb

The System Resiliency Pyramid

The system resiliency pyramid provides a holistic framework for thinking about reliability across five key layers.

I like the way this system of layers breaks down the multiple different aspects of reliability.

Code Reliant

Traffic Jams in the Cloud: Are Overloads Sabotaging Your Application’s Reliability?

This article explores system overload using a traffic congestion analogy. I especially like the note about failover as a cause of an overload condition.

Tanveer Gill — FluxNinja

Driving successful change: Understanding DORA’s Change Failure Rate metric

in this article, I’ll dive into this vital DORA metric, detail its benchmarks, and provide practical insights to help you drive more frequent successful changes.

incident.io

Slow Down! Rate Limiting Deep Dive

This article explains four different rate limiting algorithms and includes code snippets in Java.

Code Reliant

PostgreSQL: No More VACUUM, No More Bloat

PostgreSQL vacuuming can be a total pain — and a serious threat to performance and reliability. This new database engine sounds pretty interesting.

Oriole

Rethinking infrastructure as code from scratch

Current IaC tools are like plain HTML, says this author, and we should have something like CSS to avoid repeating ourselves.

Nathan Peck

10 Years of Failure Friday at PagerDuty: Fostering Resilience, Learning and Reliability

PagerDuty looks back on a decade of weekly chaos experiments and shares advice on starting your own similar program.

Cristina Dias — PagerDuty

SRE Weekly Issue #383

Articles

Subscribe

RSS

Mastodon

Search Issues

A message from our sponsor, Rootly:

Articles

Subscribe

RSS

Mastodon

Search Issues