SRE Weekly Issue #312

View on sreweekly.com

Articles

Aircraft CRASHES into Taxi and Bridge, WHAT happened?!

There’s a really great discussion of “pilot error” at the end of this air accident summary video.

Mentour Pilot

IRConf – The Incident Response Conference

There are some really great names and talks on the agenda for this half-day virtual conference on April 1.

IRConf

Building an SRE Career Progression Framework

This article is about building a framework, rather than using one off-the-shelf, to ensure that it’s tailored to the needs of your orgnaization.

Ethan Motion

What SREs Can Learn from Capt. Sully: When to Follow Playbooks

When are you smarter than your playbooks, and when are your playbooks smarter than you?

Andre King — Rootly
This article is published by my sponsor, Rootly, but their sponsorship did not influence its inclusion in this issue.

Incident Analysis 101: Putting It All Together

This one is about piecing together the story of how an incident unfolded. One interviewee might mention something new, and then you can ask later interviewees about it.

Cory Watson — Jeli

Alert Fatigue in SRE: What It Is & How To Avoid It

All about alert fatigue: how to recognize it and how to fix it once you notice it.

Emily Arnott — Blameless

GitHub Availability Report: February 2022

This one includes a summary of their February 2 outage:

[…] a routine deployment failed to generate the complete set of integrity hashes needed for Subresource Integrity. The resulting output was missing values needed to securely serve Javascript assets on GitHub.com.

Jakub Oleksy — GitHub

Postmortems Now Called Retrospectives in Blameless – Blameless

Following on last week’s article about the term “postmortem”, this one has even more great reasons to pick a different word.

Blameless

Incident Analysis 101: Writing up your findings

This article recommends a two-stage approach to writing an incident retrospective report: a “calibration document” and then the final report.

Thai Wood — Jeli

Outages

Tasmania
Discord

Something’s on fire! We’re looking into it, hang tight.

SRE WEEKLY

A message from our sponsor, Rootly:

Articles

Outages

Related