{"id":579,"date":"2022-05-30T01:43:37","date_gmt":"2022-05-30T01:43:37","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2022\/05\/30\/sre-weekly-issue-324\/"},"modified":"2022-05-30T01:43:37","modified_gmt":"2022-05-30T01:43:37","slug":"sre-weekly-issue-324","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2022\/05\/30\/sre-weekly-issue-324\/","title":{"rendered":"SRE Weekly Issue #324"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-324\/\" title=\"Permalink to SRE Weekly Issue #324\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/rootly.com\/demo\/?utm_source=sreweekly\">Rootly<\/a>:<\/h2>\n<p>Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set):<br \/>\n<a href=\"https:\/\/rootly.com\/demo\/\">https:\/\/rootly.com\/demo\/<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"wp-block-group\">\n<div class=\"wp-block-group__inner-container\">\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/thenewstack.io\/the-need-to-decouple-human-error-from-incident-response\/\" target=\"_blank\" rel=\"noopener\">The Need to Decouple Human Error from Incident Response<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>We\u2019ll start off this week with a recap of a KubeCon talk that urges leaving the concept of \u201chuman error\u201d behind.<\/p>\n<p>\u00a0\u00a0Jennifer Riggins \u2014 The New Stack<br \/>\n\u00a0\u00a0Talk by Silvia Pina<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/rootly.com\/blog\/5-tips-if-you-re-the-1st-sre-hire-by-instacart-s-first-sre\" target=\"_blank\" rel=\"noopener\">5 Tips If You\u2019re the 1st SRE Hire by Instacart\u2019s First SRE<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Just to be clear, they\u2019re saying the tips are written by Instacart\u2019s first SRE \u2014 they\u2019re not tips aimed oddly specifically at the second Instacart SRE.  Good tips, too.<\/p>\n<p>\u00a0\u00a0Quentin Rousseau \u2014 Rootly<br \/>\n<em>This article is published by my sponsor, Rootly, but their sponsorship did not influence its inclusion in this issue.<\/em><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/utcc.utoronto.ca\/~cks\/space\/blog\/sysadmin\/HaveGeneralHealthMetric\" target=\"_blank\" rel=\"noopener\">Systems should expose a (simple) overall health metric as well as specifics<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This is a really good point, and well argued.  Then there\u2019s an amusing bit at the end about alerting on the number of WARNING-level log messages generated by the system as a proxy for overall health.<\/p>\n<p>\u00a0\u00a0Chris Siebenmann<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.honeycomb.io\/blog\/tracking-on-call-health\/\" target=\"_blank\" rel=\"noopener\">Tracking On-Call Health<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>In this post, I\u2019m going to expand on the values we\u2019re currently using at Honeycomb to monitor on-call health, why we think they\u2019re good, and some of the challenges we\u2019re still encountering.<\/p>\n<p>\u00a0\u00a0Fred Hebert \u2014 Honeycomb<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.pagerduty.com\/blog\/when-incident-response-requires-business-response-who-should-you-notify\/\" target=\"_blank\" rel=\"noopener\">When incident response requires business response, who should you notify?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Internal and external communication are critical in an incident, second (perhaps) only to actually resolving the problem.  Read this article to learn about who you need to communicate with, how to talk to them, and how to prepare in advance.<\/p>\n<p>\u00a0\u00a0Hannah Culver \u2014 PagerDuty<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/firehydrant.com\/blog\/time-for-sre-hero-to-pass-the-ball-and-how-to-get-there\/\" target=\"_blank\" rel=\"noopener\">We can\u2019t all be Shaq: why it\u2019s time for the SRE hero to pass the ball and how to get there<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>If you\u2019re playing the hero role at your organization, you might be unintentionally masking the need for better incident management practices. <\/p>\n<p>\u00a0\u00a0Malcolm Preston \u2014 FireHydrant<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<p><a href=\"https:\/\/www.dynstatus.com\/incidents\/1xlbp98xr3y2\">Oracle Dyn<\/a><br \/>\n<a href=\"https:\/\/www.digitaltrends.com\/social-media\/instagram-is-experiencing-issues-and-appears-to-be-down\/\">Meta<\/a><br \/>\n<a href=\"https:\/\/www.the-sun.com\/news\/5441870\/starbucks-app-down-frustrated-coffee-drinkers\/\">Starbucks<\/a><br \/>\n<a href=\"https:\/\/www.somersetlive.co.uk\/whats-on\/whats-on-news\/easyjet-passengers-warned-flight-delays-7131985\">easyJet<\/a><br \/>\n<a href=\"https:\/\/status.cloud.google.com\/incidents\/Gt6njQyniuxXViQULV2T\">Google BigQuery<\/a><br \/>\n<a href=\"https:\/\/www.githubstatus.com\/incidents\/zhtplv7zd052\">GitHub<\/a><br \/>\nSRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/ Articles The Need to Decouple&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2022\/05\/30\/sre-weekly-issue-324\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #324<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-579","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":543,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/21\/sre-weekly-issue-310\/","url_meta":{"origin":579,"position":0},"title":"SRE Weekly Issue #310","date":"February 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":535,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/24\/sre-weekly-issue-306\/","url_meta":{"origin":579,"position":1},"title":"SRE Weekly Issue #306","date":"January 24, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":546,"url":"https:\/\/fde.cat\/index.php\/2022\/03\/07\/sre-weekly-issue-312\/","url_meta":{"origin":579,"position":2},"title":"SRE Weekly Issue #312","date":"March 7, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":537,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/31\/sre-weekly-issue-307\/","url_meta":{"origin":579,"position":3},"title":"SRE Weekly Issue #307","date":"January 31, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":603,"url":"https:\/\/fde.cat\/index.php\/2022\/07\/04\/sre-weekly-issue-329\/","url_meta":{"origin":579,"position":4},"title":"SRE Weekly Issue #329","date":"July 4, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":521,"url":"https:\/\/fde.cat\/index.php\/2021\/12\/27\/sre-weekly-issue-302\/","url_meta":{"origin":579,"position":5},"title":"SRE Weekly Issue #302","date":"December 27, 2021","format":false,"excerpt":"View on sreweekly.com Happy holidays, for those that celebrate! I put this issue together in advance, so no Outages section this week. A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/579","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=579"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/579\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=579"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=579"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=579"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}