Articles
After an out-of-hours alert the responder gets the following 24 hours off from on-call. This helps with the social/health implications of being woken up multiple nights in a row.
Outages
- Bankwest
- Seacom
- Verizon
- Spark (New Zealand telecom)
- npm
-
NPM had an 8+ hour outage that left many build system providers such as Travis, CircleCI, and Heroku and individual users scrambling. Their postmortem indicates that a major end-of-day deploy the day before was to blame. Conversation in a related github issue suggests a monitoring gap and a lack of overnight on-call coverage for complex outages such as this one.
Full disclosure: Heroku, my employer, is mentioned.
-
- WhatsApp
-
This article alleges that the government of Zimbabwe cut access to WhatsApp to disrupt anti-government protests.
-
- Etsy
- Pokemon Go
-
Seems like approximately the entire internet is talking about this.
-
- Tinder
- Claro (Dominican Republic telecom
- ReachNow (car sharing service)
-
BMW is offering customers a $10 credit.
-
- Orange Poland (telecom)
- Pingdom
-
Pingdom had a series of outages in its API and UI this week. As a result, they are planning to create a status site after previously relying on Twitter to notify customers of outages.
-