{"id":617,"date":"2022-08-08T01:12:47","date_gmt":"2022-08-08T01:12:47","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2022\/08\/08\/sre-weekly-issue-333\/"},"modified":"2022-08-08T01:12:47","modified_gmt":"2022-08-08T01:12:47","slug":"sre-weekly-issue-333","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2022\/08\/08\/sre-weekly-issue-333\/","title":{"rendered":"SRE Weekly Issue #333"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-333\/\" title=\"Permalink to SRE Weekly Issue #333\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/rootly.com\/demo\/?utm_source=sreweekly\">Rootly<\/a>:<\/h2>\n<p>Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set):<br \/>\n<a href=\"https:\/\/rootly.com\/demo\/\">https:\/\/rootly.com\/demo\/<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"wp-block-group\">\n<div class=\"wp-block-group__inner-container\">\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/metrist.io\/blog\/is-sre-just-ops-with-a-new-name\/\" target=\"_blank\" rel=\"noopener\">Is SRE Just Ops with a New Name?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>They asked four people and got four answers that run the gamut.<\/p>\n<p>\u00a0\u00a0Jeff Martens \u2014 Metrist<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/medium.com\/airbnb-engineering\/incident-management-ae863dc5d47f\" target=\"_blank\" rel=\"noopener\">Automated Incident Management Through Slack<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>How Airbnb automates incident management in a world of complex, rapidly evolving ensemble of microservices.<\/p>\n<p>Includes an overview of their ChatOps system that would make for a great blueprint to build your own.<\/p>\n<p>\u00a0\u00a0Vlad Vassiliouk \u2014 Airbnb<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/jonstevenshall.medium.com\/dont-overcategorise-incidents-e5f275154090\" target=\"_blank\" rel=\"noopener\">Don\u2019t overcategorise incidents<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Rigidly categorizing incidents can cause problems, according to this article.<\/p>\n<p>From the customer\u2019s viewpoint\u2026 well why would they care what kind of technical classification it is being forced into?<\/p>\n<p>\u00a0\u00a0Jon Stevens-Hall<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/newrelic.com\/blog\/how-to-relic\/best-practices-for-alerts\" target=\"_blank\" rel=\"noopener\">Best Practices for Fixing Your Alerts<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Lots of great advice in this one.<\/p>\n<p>If no human needs to be involved, it\u2019s pure automation.<br \/>\nIf it doesn\u2019t need a response right now, it\u2019s a report.<br \/>\nIf the thing you\u2019re observing isn\u2019t a problem, it\u2019s a dashboard.<br \/>\nIf nothing actually needs to be done, you should delete it.<\/p>\n<p>\u00a0\u00a0 Leon Adato \u2014 New Relic<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/incident.io\/blog\/customer-focused-incident-response\" target=\"_blank\" rel=\"noopener\">Driving a customer-focused incident response process<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Using the recent Atlassian outage as a case study, this article explains the importance of communication during an incident, then goes over best practices.<\/p>\n<p>\u00a0\u00a0Martha Lambert \u2014 incident.io<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.blameless.com\/sre\/sre-from-theory-to-practice-whats-difficult-about-on-call-discussion\" target=\"_blank\" rel=\"noopener\">SRE: From Theory to Practice | What\u2019s difficult about on-call?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>My favorite part about this is the advice to \u201clower the cost of being wrong\u201d.  Important in any case, but especially during incident response.<\/p>\n<p>\u00a0\u00a0Emily Arnott \u2014 Blameless<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/github.blog\/2022-08-03-github-availability-report-july-2022\/\" target=\"_blank\" rel=\"noopener\">GitHub Availability Report: July 2022<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>There are some interesting incidents in this issue: one involving DNS and another with an overload involving over-eager retries.<\/p>\n<p>\u00a0\u00a0Jakub Oleksy \u2014 GitHub<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.blameless.com\/sre\/sre-interview-questions\" target=\"_blank\" rel=\"noopener\">Top SRE Interview Questions You Should Know<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>A great read both for interviewers and interviewees.<\/p>\n<p>\u00a0\u00a0Myra Nizami \u2014 Blameless<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/semaphoreci.com\/blog\/bad-microservices\" target=\"_blank\" rel=\"noopener\">When Microservices Are a Bad Idea<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Their main advice is to avoid <em>starting<\/em> with a microservice architecture, and only transition to one after your monolith has matured and you have a good reason to do so.<\/p>\n<p>\u00a0\u00a0Tomas Fernandez and Dan Ackerson \u2014 semaphore<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<p><a href=\"https:\/\/u.today\/solana-suffers-minor-service-outage-as-thousands-of-wallets-get-drained\">Solana<\/a><br \/>\n<a href=\"https:\/\/twitter.com\/Paytm\/status\/1555458737766219776\">Paytm<\/a><br \/>\n<a href=\"https:\/\/www.bbc.com\/news\/uk-wales-62442127\">National Health Service (UK)<\/a><br \/>\n<a href=\"https:\/\/www.americanbanker.com\/payments\/news\/how-badly-was-bread-burned-by-its-online-credit-card-payment-glitch\">Bread<\/a><br \/>\n<a href=\"https:\/\/www.dreamhoststatus.com\/pages\/incident\/575f0f606826303142000510\/62ddab5ddb464e053416ad87\">DreamHost<\/a><br \/>\n<a href=\"https:\/\/www.independent.co.uk\/tech\/flightradar24-down-live-plane-tracking-site-goes-offline-b2136429.html\">Flightradar24<\/a><br \/>\n<a href=\"https:\/\/www.stackstatus.net\/incidents\/9cbac2e1-6bdc-41c9-b520-8875d3ce5050\">Stack Exchange<\/a><br \/>\nSRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/ Articles Is SRE Just Ops&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2022\/08\/08\/sre-weekly-issue-333\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #333<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-617","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":543,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/21\/sre-weekly-issue-310\/","url_meta":{"origin":617,"position":0},"title":"SRE Weekly Issue #310","date":"February 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":579,"url":"https:\/\/fde.cat\/index.php\/2022\/05\/30\/sre-weekly-issue-324\/","url_meta":{"origin":617,"position":1},"title":"SRE Weekly Issue #324","date":"May 30, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":535,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/24\/sre-weekly-issue-306\/","url_meta":{"origin":617,"position":2},"title":"SRE Weekly Issue #306","date":"January 24, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":489,"url":"https:\/\/fde.cat\/index.php\/2021\/10\/18\/sre-weekly-issue-292\/","url_meta":{"origin":617,"position":3},"title":"SRE Weekly Issue #292","date":"October 18, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo: https:\/\/rootly.io\/?utm_source=sreweekly Articles Four lessons every company should learn\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":540,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/07\/sre-weekly-issue-308\/","url_meta":{"origin":617,"position":4},"title":"SRE Weekly Issue #308","date":"February 7, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":552,"url":"https:\/\/fde.cat\/index.php\/2022\/03\/14\/sre-weekly-issue-313\/","url_meta":{"origin":617,"position":5},"title":"SRE Weekly Issue #313","date":"March 14, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/617","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=617"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/617\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=617"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=617"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=617"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}