This article series asks, do you really need ACID consistency?
Well, of course ACID consistency exists – and it is a good thing that it exists. Thus, feel free to call the post title clickbait … ;)
My point here is that it should not exist as functional requirement.
Uwe Friedrichsen
OpenAI posted this mini report on their outage on January 30.
OpenAI
It’s never DNS, except when it’s definitely DNS, such as in the case of this probable DNSSEC misconfiguration.
Wilson Chua — Manila Bulletin
Do you want to prioritize availability or control?
Teiva Harsanyi — The Coder Cafe
The amount of attention an incident gets is proportional to the severity of the incident: the greater the impact to the organization, the more attention that post-incident activities will get.
The problem is that the severity of a near-miss incident is zero, but it can have significant value for learning even still.
Lorin Hochstein
This article urges caution in creating alerts that recommend a specific course of action when they fire. It explains why this can be dangerous and suggests alternative methods.
Fred Hebert — Honeycomb
In this post, I will highlight some crucial Kubernetes best practices. They are from my years of experience with Kubernetes in production. Think of this as the curated “Kubernetes cheat sheet” you wish you had from Day 1.
Engin Diri — Pulumi
Meta’s profiling system has helped them save thousands of servers’ worth of computing resources, through continuous profiling and centralized symbolization.
Jordan Rome — Meta