{"id":545,"date":"2022-02-28T02:43:34","date_gmt":"2022-02-28T02:43:34","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2022\/02\/28\/sre-weekly-issue-311\/"},"modified":"2022-02-28T02:43:34","modified_gmt":"2022-02-28T02:43:34","slug":"sre-weekly-issue-311","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2022\/02\/28\/sre-weekly-issue-311\/","title":{"rendered":"SRE Weekly Issue #311"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-311\/\" title=\"Permalink to SRE Weekly Issue #311\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<p>I\u2019m dedicating this issue to the people of Ukraine, and also those in Russia that are protesting the invasion.<\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/rootly.com\/demo\/?utm_source=sreweekly\">Rootly<\/a>:<\/h2>\n<p>Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt):<br \/><a href=\"https:\/\/rootly.com\/demo\/?utm_source=sreweekly\">https:\/\/rootly.com\/demo\/?utm_source=sreweekly<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"wp-block-group\">\n<div class=\"wp-block-group__inner-container\">\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.pageittothelimit.com\/easing-into-incident-command-with-iris-carrera\/\" target=\"_blank\" rel=\"noopener\">Easing Into Incident Command With Iris Carrera<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>In this episode of the podcast Page it to the Limit, they discuss learning how to be an incident commander.<\/p>\n<p>There was major AWS outage and the second day I was incident command.<\/p>\n<p>\u00a0\u00a0Kat Gaines, with guest Iris Carrera \u2014 Page it to the Limit<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/medium.com\/swlh\/the-ownership-trio-482a4e5f666d?source=rss-59d815140088------2\" target=\"_blank\" rel=\"noopener\">The ownership trio<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This article discusses three aspects of fully owning your systems: mandate, knowledge, and responsibility.  After defining those terms, it goes on to discuss what happens if one of the three is missing.<\/p>\n<p>\u00a0\u00a0Alex Ewerl\u00f6f<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/netflixtechblog.com\/rapid-event-notification-system-at-netflix-6deb1d2b57d1?source=rss----2615bd06b42e---4\" target=\"_blank\" rel=\"noopener\">Rapid Event Notification System at Netflix<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>I really like the \u201cManaging High RPS\u201d section, especially the part about ignoring events if they\u2019re too old to be relevant any longer.<\/p>\n<p>\u00a0\u00a0Ankush Gulati and David Gevorkyan \u2014 Netflix<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/engineering.linkedin.com\/blog\/2022\/hodor--detecting-and-addressing-overload-in-linkedin-microservic\" target=\"_blank\" rel=\"noopener\">Hodor: Detecting and addressing overload in LinkedIn microservices<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Cool idea!  When a process is overloaded, the system drops requests based on heuristics until the overload condition has passed.<\/p>\n<p>\u00a0\u00a0Bryan Barkley \u2014 LinkedIn<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/firehydrant.io\/blog\/incident-severity-and-priority-101\/\" target=\"_blank\" rel=\"noopener\">Incident severity and priority 101<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Here\u2019s another take on incident severity and priority levels.  The two terms are different and mean specific things.<\/p>\n<p>\u00a0\u00a0Robert Ross \u2014 FireHydrant<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/five9s.substack.com\/p\/renaming-sre-outage-post-mortems?utm_source=url\" target=\"_blank\" rel=\"noopener\">Renaming SRE outage \u201cpost-mortems\u201d for psychological safety<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Can we please agree to stop calling them \u201cpostmortems\u201d?<\/p>\n<p>\u00a0\u00a0Ash P \u2014 Cruform Newsletter<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.last9.io\/the-origin-of-service-level-objectives\/\" target=\"_blank\" rel=\"noopener\">The origin of Service Level Objectives<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The term \u201cservice level\u201d goes back to the US highway system maintenance procedures, among others.<\/p>\n<p>\u00a0\u00a0Akshay Chugh and Piyush Verma \u2014 Last9<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.honeycomb.io\/blog\/truth-about-meh-trics-metrics\/\" target=\"_blank\" rel=\"noopener\">The Truth About \u201cMEH-TRICS\u201d<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Charity Majors has railed against metrics for years.  Now, her company Honeycomb has a metrics product offering.  How does she square it?<\/p>\n<p>\u00a0\u00a0Charity Majors \u2014 Honeycomb<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/cloudpundit.com\/2022\/02\/22\/resilience-cloudy-without-a-chance-of-meatballs\/\" target=\"_blank\" rel=\"noopener\">Resilience: Cloudy without a chance of meatballs<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Despite the December AWS outage, folks aren\u2019t fleeing AWS, and multi-cloud designs for reliability still don\u2019t make sense, according to this cloud consultant.  The media angle is fascinating.<\/p>\n<p>\u00a0\u00a0Lydia Leong \u2014 Cloud Pundit<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.jeli.io\/incident-analysis-101-interviewing-how-to-determine-who-to-interview\/?utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=incident-analysis-101-interviewing-how-to-determine-who-to-interview\" target=\"_blank\" rel=\"noopener\">Incident Analysis 101: Interviewing \u2013 How to determine who to interview<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This article has a great list of ideas of who to talk to, plus a section on how to prioritize when you\u2019re short on time.<\/p>\n<p>\u00a0\u00a0Daniela Hurtado \u2014 Jeli<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<p><a href=\"https:\/\/status.slack.com\/2022-02\/3083852db30af86f\">Slack<\/a><\/p>\n<p>They posted a followup with details on what happened.<\/p>\n<p>A configuration change inadvertently lead to a sudden increase in activity on our database infrastructure.<\/p>\n<p><a href=\"https:\/\/status.crates.io\/incidents\/1mqxfrx56qdh\">crates.io (Rust package repository)<\/a><br \/>\n<a href=\"https:\/\/www.thenationalnews.com\/world\/uk-news\/2022\/02\/25\/british-airways-suffers-second-computer-service-interruption-in-a-week\/\">British Airways<\/a><br \/>\n<a href=\"https:\/\/www.businessinsider.com\/donald-trump-social-media-app-truth-social-outage-minutes-launch-2022-2\">Truth Social<\/a><br \/>\n<a href=\"https:\/\/status.onepeloton.com\/incidents\/kxpr1mysktbb\">Peloton<\/a><br \/>\n<a href=\"https:\/\/truthsocial.statuspage.io\/\">Truth Social<\/a><\/p>\n<p>Due to the overwhelming demand at launch, we are currently rate-limited on onboarding new users to the platform.<\/p>\n<p>SRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com I\u2019m dedicating this issue to the people of Ukraine, and also those in Russia that are protesting the invasion. A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2022\/02\/28\/sre-weekly-issue-311\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #311<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-545","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":543,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/21\/sre-weekly-issue-310\/","url_meta":{"origin":545,"position":0},"title":"SRE Weekly Issue #310","date":"February 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":579,"url":"https:\/\/fde.cat\/index.php\/2022\/05\/30\/sre-weekly-issue-324\/","url_meta":{"origin":545,"position":1},"title":"SRE Weekly Issue #324","date":"May 30, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":535,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/24\/sre-weekly-issue-306\/","url_meta":{"origin":545,"position":2},"title":"SRE Weekly Issue #306","date":"January 24, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":537,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/31\/sre-weekly-issue-307\/","url_meta":{"origin":545,"position":3},"title":"SRE Weekly Issue #307","date":"January 31, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":546,"url":"https:\/\/fde.cat\/index.php\/2022\/03\/07\/sre-weekly-issue-312\/","url_meta":{"origin":545,"position":4},"title":"SRE Weekly Issue #312","date":"March 7, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":603,"url":"https:\/\/fde.cat\/index.php\/2022\/07\/04\/sre-weekly-issue-329\/","url_meta":{"origin":545,"position":5},"title":"SRE Weekly Issue #329","date":"July 4, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/545","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=545"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/545\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=545"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=545"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=545"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}