Articles
A nifty little pitfall in which an ionice
d process can block non-ionice
d processes.Author: rachelbythebay
Google published this free set of courses on technical writing. As an SRE, I have the constant need to write effectively to justify and document my designs.
Every engineer is also a writer.
This collection of courses and learning resources aims to improve your technical documentation. Learn how to plan and author technical documents.
The ACM has made their ACM Digital Library free to the public for the next 3 months. Many of their articles have been featured here previously.
Includes a great article by Jamie Woo, entitled Imagining Your Post-Incident Report As A Documentary.
Emil Stolarsky and Jaime Woo — The Post-Incident Review
Blameless recently had the privilege of hosting SRE leaders Liz Fong-Jones, Dave Rensin, and Alex Hidalgo to discuss how SREs can embrace resilience during pandemic, and how the principles of SRE intersect with global trends.
I especially liked the discussion of pent-up demand that may cause problems when we eventually get to relax social distancing.
Amy Tobey (moderator), Alex Hidalgo, Liz Fong-Jones, Dave Rensin
This is a talk that John Allspaw gave for Spotify.
Learning is not the same as fixing
John Allspaw — Adaptive Capacity Labs
Outages
- Google Cloud Platform
- This is an update to the outage included in last week’s issue, giving details on what went wrong. A problem with Cloud IAM affected many other GCP services.
- Let’s Encrypt
- GitHub
- Apple News
- Facebook, Instagram, WhatsApp
- Twitch
- GameStop
- Discord
- Includes a short description of what went wrong. Take it easy on yourselves, Discord folks, it happens to all of us. ♥