{"id":557,"date":"2022-03-28T01:15:25","date_gmt":"2022-03-28T01:15:25","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2022\/03\/28\/sre-weekly-issue-315\/"},"modified":"2022-03-28T01:15:25","modified_gmt":"2022-03-28T01:15:25","slug":"sre-weekly-issue-315","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2022\/03\/28\/sre-weekly-issue-315\/","title":{"rendered":"SRE Weekly Issue #315"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-315\/\" title=\"Permalink to SRE Weekly Issue #315\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<p>I\u2019m going on vacation, so I\u2019m going to prepare next week\u2019s issue in advance.  It\u2019ll look much like most issues, except there won\u2019t be an Outages section.  See you all in two weeks!<\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/rootly.com\/demo\/?utm_source=sreweekly\">Rootly<\/a>:<\/h2>\n<p>Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set):<br \/><a href=\"https:\/\/rootly.com\/demo\/\">https:\/\/rootly.com\/demo\/<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"wp-block-group\">\n<div class=\"wp-block-group__inner-container\">\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.jeli.io\/blog\/incident-analysis-101-facilitating-a-learning-review-without-prior-interviews\/\" target=\"_blank\" rel=\"noopener\">Incident Analysis 101: Facilitating a Learning Review Without Prior Interviews<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>In the previous articles in this series, they described a process of interviewing incident responders before a full retrospective meeting. This one discusses what to do if you can\u2019t conduct those interviews, and the particular challenges this will bring and how to deal with them.<\/p>\n<p>\u00a0\u00a0Emily Ruppe \u2014 Jeli<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/brooker.co.za\/blog\/2022\/02\/16\/circuit-breakers.html\" target=\"_blank\" rel=\"noopener\">Will circuit breakers solve my problems?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Some interesting ideas on potential downsides of circuit breakers and how we might ameliorate them.<\/p>\n<p>\u00a0\u00a0Marc Brooker<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/github.blog\/2022-03-23-an-update-on-recent-service-disruptions\/\" target=\"_blank\" rel=\"noopener\">[GitHub] An update on recent service disruptions<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>GitHub has had a bit of a hard time lately.  Here\u2019s an update on what they\u2019re dealing with and how they\u2019re planning to address it.<\/p>\n<p>\u00a0\u00a0Keith Ballinger \u2014 GitHub<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.infoq.com\/articles\/mtt-metrics-incident-response\/\" target=\"_blank\" rel=\"noopener\">How to Best Use MTT* Metrics to Optimize Your Incident Response <\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>All sorts of \u201cmean time to\u201d metrics, including 6(!) different MTTR metrics and how they might be used.<\/p>\n<p>\u00a0\u00a0Alex Ewerl\u00f6f \u2014 InfoQ<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.equalexperts.com\/wp-content\/uploads\/2022\/03\/YBIYRI_Playbook-4.pdf\" target=\"_blank\" rel=\"noopener\">You Build It You Run It Playbook<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This is a huge 100+-page report on the benefits of a model in which development teams own the operation of their systems.  There\u2019s a lot in here, with carefully spelled-out pros\/cons and cost\/benefit analyses.  Need to convince someone?  Send them this.<\/p>\n<p>We\u2019ve written this playbook for CxOs, product managers, delivery managers, and<br \/>\noperations managers.<\/p>\n<p>\u00a0\u00a0Bethan Timmins and Steve Smith \u2014 Equal Experts<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/medium.com\/in-the-hudl\/operation-jumbo-drop-how-sending-large-packets-broke-our-aws-network-ff5041fc7a09\" target=\"_blank\" rel=\"noopener\">Operation Jumbo Drop: How sending large packets broke our AWS network<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>It\u2019s easy to miss MTUs, until they sneak up on you and cause really confusing problems.<\/p>\n<p>\u00a0\u00a0Aaron Kalair \u2014 Hudl<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/incident.io\/blog\/fair-on-call-compensation\" target=\"_blank\" rel=\"noopener\">What\u2019s a fair compensation for being on-call?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Should you compensate for on-call? How? I really want to see more articles about this, so send them my way if you see or write any.<\/p>\n<p>\u00a0\u00a0Chris Evans \u2014 Incident.io<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.last9.io\/how-to-improve-on-call-experience\/\" target=\"_blank\" rel=\"noopener\">How to Improve On-Call Experience!<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Some good tips in this article, and I love the case studies.<\/p>\n<p>\u00a0\u00a0Prathamesh Sonpatki \u2014 Last9<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<p><a href=\"https:\/\/status.pagerduty.com\/incidents\/v3s70z5yrg45\">PagerDuty<\/a><br \/>\n<a href=\"https:\/\/www.cnn.com\/2022\/03\/21\/tech\/apple-outage\/index.html\">Apple App Store, Apple Music and iCloud<\/a><br \/>\n<a href=\"https:\/\/www.githubstatus.com\/incidents\/83lq7ftk19r5\">GitHub<\/a><\/p>\n<p>They had several incidents this week.<\/p>\n<p><a href=\"https:\/\/domainincite.com\/27675-another-dnssec-screw-up-takes-down-thousands-of-au-domains\">.au TLD<\/a><\/p>\n<p>DNSSec.<\/p>\n<p><a href=\"https:\/\/www.actionnetwork.com\/legal-online-sports-betting\/prominent-offshore-site-sportsbook-ag-offline\">Sportsbook.ag<\/a><br \/>\nSRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com I\u2019m going on vacation, so I\u2019m going to prepare next week\u2019s issue in advance. It\u2019ll look much like most issues, except there won\u2019t be an Outages section. See you all in two weeks! A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2022\/03\/28\/sre-weekly-issue-315\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #315<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-557","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":579,"url":"https:\/\/fde.cat\/index.php\/2022\/05\/30\/sre-weekly-issue-324\/","url_meta":{"origin":557,"position":0},"title":"SRE Weekly Issue #324","date":"May 30, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":537,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/31\/sre-weekly-issue-307\/","url_meta":{"origin":557,"position":1},"title":"SRE Weekly Issue #307","date":"January 31, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":543,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/21\/sre-weekly-issue-310\/","url_meta":{"origin":557,"position":2},"title":"SRE Weekly Issue #310","date":"February 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":663,"url":"https:\/\/fde.cat\/index.php\/2022\/12\/19\/sre-weekly-issue-352\/","url_meta":{"origin":557,"position":3},"title":"SRE Weekly Issue #352","date":"December 19, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly\u00a0\ud83d\ude92. Rootly automates manual tasks like creating an incident channel, Jira ticket and Zoom rooms, inviting responders, creating statuspage updates, postmortem timelines and more. Want to see why companies like Canva and Grammarly love us?:\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":535,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/24\/sre-weekly-issue-306\/","url_meta":{"origin":557,"position":4},"title":"SRE Weekly Issue #306","date":"January 24, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":509,"url":"https:\/\/fde.cat\/index.php\/2021\/11\/29\/sre-weekly-issue-298\/","url_meta":{"origin":557,"position":5},"title":"SRE Weekly Issue #298","date":"November 29, 2021","format":false,"excerpt":"View on sreweekly.com Email subscribers, my apologies for the double-send last week. I upgraded WordPress and subsequently further cemented my distrust of all version upgrades ever. I carefully tested a fix in staging before rolling it out gradually in preparation for this week\u2019s issue. Just kidding, I hacked on it\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/557","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=557"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/557\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=557"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=557"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=557"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}