{"id":664,"date":"2022-12-26T21:01:20","date_gmt":"2022-12-26T21:01:20","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2022\/12\/26\/sre-weekly-issue-353\/"},"modified":"2022-12-26T21:01:20","modified_gmt":"2022-12-26T21:01:20","slug":"sre-weekly-issue-353","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2022\/12\/26\/sre-weekly-issue-353\/","title":{"rendered":"SRE Weekly Issue #353"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-353\/\" title=\"Permalink to SRE Weekly Issue #353\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/rootly.com\/demo\/?utm_source=sreweekly\">Rootly<\/a>:<\/h2>\n<p>Manage incidents directly from Slack with Rootly\u00a0\ud83d\ude92.<\/p>\n<p>Rootly automates manual tasks like creating an incident channel, Jira ticket and Zoom rooms, inviting responders, creating statuspage updates, postmortem timelines and more. Want to see why companies like Canva and Grammarly love us?:<\/p>\n<p><a href=\"https:\/\/rootly.com\/demo\/\">https:\/\/rootly.com\/demo\/<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"wp-block-group\">\n<div class=\"wp-block-group__inner-container\">\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.itprotoday.com\/it-operations\/what-does-future-hold-sre-roles\" target=\"_blank\" rel=\"noopener\">What Does the Future Hold for Role of SRE?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This article contains:<\/p>\n<p>two reasons why site reliability engineers may be part of IT teams for years to come, and two reasons why site reliability engineering may turn out just to be a fad.<\/p>\n<p>\u00a0\u00a0<small>Christopher Tozzi \u2014 ITPro Today<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/fiberplane.dev\/blog\/best-practices-for-observability\/\" target=\"_blank\" rel=\"noopener\">Best practices for observability<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This article proposes an interesting method for incident investigation: constantly try to <em>disprove<\/em> your hypotheses to avoid confirmation bias.<\/p>\n<p>\u00a0\u00a0<small>Ivan Merill \u2014 Fiberplane<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/hackaday.com\/2015\/10\/26\/killed-by-a-machine-the-therac-25\/\" target=\"_blank\" rel=\"noopener\">Killed By A Machine: The Therac-25<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>How I\u2019ve managed to run this newsletter for almost 7 years without a single mention of the Therac-25 incidents is beyond me.  Therac-25 is an important lesson for all of us as we design systems and analyze incidents.<\/p>\n<p>\u00a0\u00a0<small>Adam Fabio \u2014 Hackaday<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/wiki.secondlife.com\/wiki\/User:Poppy_Linden\/Network_Errors_and_Data_Loss_2008-01\" target=\"_blank\" rel=\"noopener\">Network Errors and Data Loss 2008-01<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Even though this happened 14 years ago, the cause is very much still relevant today.  If you have two bit-flips in the same TCP packet, it\u2019ll still pass the checksum.<\/p>\n<p>\u00a0\u00a0<small>Poppy Linden \u2014 Linden Lab<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.danslimmon.com\/2017\/10\/02\/what-makes-a-good-alert\/\" target=\"_blank\" rel=\"noopener\">What makes a good alert?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This article proposes two criteria: Actionability and Investigability.<\/p>\n<p>\u00a0\u00a0<small>Dan Slimmon<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/incident.io\/blog\/intermittent-downtime\" target=\"_blank\" rel=\"noopener\">Intermittent downtime from repeated crashes<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This incident write-up chronicles an incident in which a poison pill message repeatedly crashed their Heroku app.<\/p>\n<p>\u00a0\u00a0<small>Lawrence Jones \u2014 incident.io<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/admiralcloudberg.medium.com\/the-words-not-spoken-the-crash-of-avianca-flight-052-c69145b326f2\" target=\"_blank\" rel=\"noopener\">The Words Not Spoken: The crash of Avianca flight 052<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Take this one with a grain of salt since there\u2019s a fair bit of counterfactual reasoning in the description.  Nevertheless there\u2019s a lot to learn from this and <a href=\"https:\/\/en.wikipedia.org\/wiki\/Avianca_Flight_052\">Wikipedia\u2019s article on the same accident<\/a>.<\/p>\n<p>\u00a0\u00a0<small>Admiral Cloudberg<\/small><\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p>SRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly\u00a0\ud83d\ude92. Rootly automates manual tasks like creating an incident channel, Jira ticket and Zoom rooms, inviting responders, creating statuspage updates, postmortem timelines and more. Want to see why companies like Canva and Grammarly love us?: https:\/\/rootly.com\/demo\/ Articles What Does the&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2022\/12\/26\/sre-weekly-issue-353\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #353<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-664","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":543,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/21\/sre-weekly-issue-310\/","url_meta":{"origin":664,"position":0},"title":"SRE Weekly Issue #310","date":"February 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":579,"url":"https:\/\/fde.cat\/index.php\/2022\/05\/30\/sre-weekly-issue-324\/","url_meta":{"origin":664,"position":1},"title":"SRE Weekly Issue #324","date":"May 30, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":535,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/24\/sre-weekly-issue-306\/","url_meta":{"origin":664,"position":2},"title":"SRE Weekly Issue #306","date":"January 24, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":540,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/07\/sre-weekly-issue-308\/","url_meta":{"origin":664,"position":3},"title":"SRE Weekly Issue #308","date":"February 7, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":653,"url":"https:\/\/fde.cat\/index.php\/2022\/11\/21\/sre-weekly-issue-348\/","url_meta":{"origin":664,"position":4},"title":"SRE Weekly Issue #348","date":"November 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly\u00a0\ud83d\ude92. Rootly automates manual tasks like creating an incident channel, Jira ticket and Zoom rooms, inviting responders, creating statuspage updates, postmortem timelines and more. Want to see why companies like Canva and Grammarly love us?:\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":640,"url":"https:\/\/fde.cat\/index.php\/2022\/10\/17\/sre-weekly-issue-343\/","url_meta":{"origin":664,"position":5},"title":"SRE Weekly Issue #343","date":"October 17, 2022","format":false,"excerpt":"View on sreweekly.com Bit of a short one this week as I recover from my third bout of COVID. Fortunately, this is another relatively mild one (thank you, vaccine!). Good luck everyone, and get your boosters. A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly\u00a0\ud83d\ude92. Rootly\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/664","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=664"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/664\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=664"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=664"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=664"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}