{"id":621,"date":"2022-08-15T01:04:37","date_gmt":"2022-08-15T01:04:37","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2022\/08\/15\/sre-weekly-issue-334\/"},"modified":"2022-08-15T01:04:37","modified_gmt":"2022-08-15T01:04:37","slug":"sre-weekly-issue-334","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2022\/08\/15\/sre-weekly-issue-334\/","title":{"rendered":"SRE Weekly Issue #334"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-334\/\" title=\"Permalink to SRE Weekly Issue #334\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<p>I\u2019ll be on vacation starting next Sunday (yay!). That means the next two issues will be prepared in advance, so there won\u2019t be an Outages section.<\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/rootly.com\/demo\/?utm_source=sreweekly\">Rootly<\/a>:<\/h2>\n<p>Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set):<br \/><a href=\"https:\/\/rootly.com\/demo\/\">https:\/\/rootly.com\/demo\/<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"wp-block-group\">\n<div class=\"wp-block-group__inner-container\">\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/incident.io\/blog\/third-party-outages\" target=\"_blank\" rel=\"noopener\">Handling third-party provider outages<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Should you go multi-cloud?  What should you do during an incident involving a third-party dependency?  What about after?  Read this one for all that and more.<\/p>\n<p>\u00a0\u00a0Lisa Karlin Curtis \u2014 incident.io<br \/>\n<em>Full disclosure: Fastly, my employer, is mentioned.<\/em><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/surfingcomplexity.blog\/2022\/07\/24\/common-ground-breakdown-in-uvalde\/\" target=\"_blank\" rel=\"noopener\">Common ground breakdown in Uvalde<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>An introduction to the concept of common ground breakdown, using the Uvalde shooting in the US as a case study.<\/p>\n<p>\u00a0\u00a0Lorin Hochstein<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.reddit.com\/r\/sre\/comments\/wny2js\/how_do_you_handle_weekly_commitments_during_your\/\" target=\"_blank\" rel=\"noopener\">r\/sre \u2013 How do you handle weekly commitments during your on call rotation?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The comments section is full of some pretty great advice, including questions you can ask while interviewing to suss out whether the on-call culture is going to be livable.<\/p>\n<p>\u00a0\u00a0u\/dicksoutfoeharambe (and others) \u2014 reddit<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/jonstevenshall.medium.com\/lessons-from-the-tsb-failure-a-perfect-storm-of-waterfall-failures-4f4d2e789b35\" target=\"_blank\" rel=\"noopener\">Lessons  from the TSB failure: a perfect storm of waterfall failures<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>From the archives, this is an analysis of a report on the 2018 major outage at TSB Bank in the UK.<\/p>\n<p>\u00a0\u00a0Jon Stevens-Hall<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"http:\/\/brooker.co.za\/blog\/2022\/08\/11\/backoff.html\" target=\"_blank\" rel=\"noopener\">What is Backoff For?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>You can determine whether backoff will actually help your system, and this article does a great job of telling you how.<\/p>\n<p>\u00a0\u00a0Marc Brooker<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.danslimmon.com\/2019\/06\/24\/an-incident-command-training-handbook\/\" target=\"_blank\" rel=\"noopener\">An Incident Command Training Handbook<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>I\u2019ve read (and written) plenty of IC training guides, but this is the first time I\u2019ve come across the concept of a \u201cHands-Off Update\u201d.  I\u2019m definitely going to use that!<\/p>\n<p>\u00a0\u00a0Dan Slimmon<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.danslimmon.com\/2019\/05\/03\/no-observability-without-theory\/\" target=\"_blank\" rel=\"noopener\">No observability without theory<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This is a really great exlpanation of observability from an angle I haven\u2019t seen before.<\/p>\n<p>a metric dashboard only contributes to observability if its reader can interpret the curves they\u2019re seeing within a theory of the system under study.<\/p>\n<p>\u00a0\u00a0Dan Slimmon<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<p><a href=\"https:\/\/www.hindustantimes.com\/technology\/we-fixed-it-twitter-rolls-back-internal-systems-change-after-service-outage-101660090890057.html\">Twitter<\/a><br \/>\n<a href=\"https:\/\/www.theguardian.com\/technology\/2022\/aug\/09\/google-outage-search-down\">Google Search<\/a><\/p>\n<p>Did you catch the Google search outage?  I\u2019ve never seen one like it \u2014 that\u2019s how rare they are.  Google shared a tidbit of information about what went wrong \u2014 and it wasn\u2019t the <a href=\"https:\/\/www.sfgate.com\/news\/article\/google-electrical-incident-injures-3-17360321.php\">datacenter explosion<\/a> folks speculated about.<\/p>\n<p><a href=\"https:\/\/status.onepeloton.com\/incidents\/b268rgl3r63s\">Peloton<\/a><br \/>\nSRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com I\u2019ll be on vacation starting next Sunday (yay!). That means the next two issues will be prepared in advance, so there won\u2019t be an Outages section. A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2022\/08\/15\/sre-weekly-issue-334\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #334<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-621","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":543,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/21\/sre-weekly-issue-310\/","url_meta":{"origin":621,"position":0},"title":"SRE Weekly Issue #310","date":"February 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":577,"url":"https:\/\/fde.cat\/index.php\/2022\/05\/16\/sre-weekly-issue-322\/","url_meta":{"origin":621,"position":1},"title":"SRE Weekly Issue #322","date":"May 16, 2022","format":false,"excerpt":"View on sreweekly.com Bit of a short issue this week. This morning, I stepped on my phone, crushing it mightily beneath my bootheel. Unfortunately a lot of my automation for reviewing articles is on there\u2026 thank goodness I have functioning backups. A message from our sponsor, Rootly: Manage incidents directly\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":579,"url":"https:\/\/fde.cat\/index.php\/2022\/05\/30\/sre-weekly-issue-324\/","url_meta":{"origin":621,"position":2},"title":"SRE Weekly Issue #324","date":"May 30, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":535,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/24\/sre-weekly-issue-306\/","url_meta":{"origin":621,"position":3},"title":"SRE Weekly Issue #306","date":"January 24, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":540,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/07\/sre-weekly-issue-308\/","url_meta":{"origin":621,"position":4},"title":"SRE Weekly Issue #308","date":"February 7, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":560,"url":"https:\/\/fde.cat\/index.php\/2022\/04\/04\/sre-weekly-issue-316\/","url_meta":{"origin":621,"position":5},"title":"SRE Weekly Issue #316","date":"April 4, 2022","format":false,"excerpt":"View on sreweekly.com I\u2019m on vacation, so I prepared this issue in advance. Practically speaking, that just means there\u2019s no Outages section this week. See you all next week! P.S. Okay, I know I said no outages, but I will say that I\u2019m keeping an eye on the Southwest Airlines\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/621","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=621"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/621\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=621"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=621"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=621"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}