Articles
If you’re gonna operate on a pile of computers all at once that numbers 6+ figures, making you type that number in is a way to make you pause and think about what you’re doing.
Rachel by the bay
Find out why they decided to focus less on nines, and what they did instead.
Robert Sullivan
Reminds me of the classic:
It’s not DNS
There’s no way it’s DNS
It was DNS
β (ssbroski on reddit)
Mike S.
Their front-end made duplicate calls to the new API to test load and response time prior to cutting over.
Michael P. Geraci β OkCupid
This is really cool. The researchers created a role-play scenario based on a real plane crash. They tried to get participants to blame “human error”, so that they could then surprise them with all of the (many) contributing factors that were involved.
Emily S. Patterson, Richard I. Cook, David D. Woods, Marta L. Render
Tips from one Sysadmin’s journey to becoming an SRE.
Josh Duffney β Octopus Deploy
Outages
- YouTube
- Macs
- Mac users had issues launching applications, owing to an outage of ocsp.apple.com. Apple confirmed the issue.
- PrometheusKube
- The link points to their awesome writeup of what went wrong and the on-the-fly reworking they had to do to fix it.
- Hotmail
- Various stock trading platforms
- There’s some speculation that this was a result of increased trading volume following Pfizer’s announcement about vaccine trial results.
- Robinhood
- Increased Error Rates