Articles
Itās one thing to say you accept call-outs of unsafe situations ā itās another to actually do it. This cardiac surgeon shares what itās like when high reliability organizations get it wrong.
Robert Poston, MD
The game has been a victim of its own success, and the developers have had to put in quite a lot of work to deal with the load.
PezRadar ā Blizzard
This includes some lesser-known roles like Social Media Lead, Legal/Compliance Lead, and Partner Lead.
JJ Tang ā Rootly
This article is published by my sponsor, Rootly, but their sponsorship did not influence its inclusion in this issue.
There are a couple of great sections in this article, including āblamelessā retrospectives that arenāt actually blameless, and being judicious in which remediation actions you take.
Chris Evans ā incident.io
I love the idea that chaos monkey could actually be propping your infrastructure up. Oops.
Lorin Hochstein
I have to say, Iām really liking this DNS series.
Jan Schaumann
What? Why the heck am I including this here?
First, letās all keep in mind that this situation is still very much unfolding, and not much is concretely known about what happened. Itās also emotionally fraught, especially for the victims and their families, and my heart goes out to them.
The thing that caught my eye about this article is that this looks like a classic complex system failure. Thereās so much at play that led to this horrible accident, as outlined in this article and others, like this one (Julia Conley, Salon).
Aya Elamroussi, Chloe Melas and Claudia Dominguez ā CNN
Outages
I feel vindicated. I knew something was wrong with my search alert RSS feeds last week!Ā Putting SRE Weekly together without Google search alerts can beā¦ challenging.