{"id":711,"date":"2023-05-08T02:40:31","date_gmt":"2023-05-08T02:40:31","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2023\/05\/08\/sre-weekly-issue-371\/"},"modified":"2023-05-08T02:40:31","modified_gmt":"2023-05-08T02:40:31","slug":"sre-weekly-issue-371","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2023\/05\/08\/sre-weekly-issue-371\/","title":{"rendered":"SRE Weekly Issue #371"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-371\/\" title=\"Permalink to SRE Weekly Issue #371\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/rootly.com\/demo\/?utm_source=sreweekly\">Rootly<\/a>:<\/h2>\n<p>Rootly is hiring for a Sr. Developer Relations Advocate to continue helping more world-class companies like Figma, NVIDIA, Squarespace, accelerate their incident management journey. Looking for previous on-call engineers with a passion for making the world a more reliable place.\u00a0 Learn more:<\/p>\n<p><a href=\"https:\/\/rootly.com\/careers?gh_jid=4015888007\">https:\/\/rootly.com\/careers?gh_jid=4015888007<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"wp-block-group\">\n<div class=\"wp-block-group__inner-container\">\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/flyingbarron.medium.com\/is-there-such-a-thing-as-a-system-thats-too-reliable-9a367ba850ac\" target=\"_blank\" rel=\"noopener\">Is there such a thing as a system that\u2019s too reliable?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>NASA chose to squeeze just a bit more science out of the Voyager spacecrafts\u2019 aging power supplies by sacrificing a layer of redundancy.  I love this so much, because it sounds just like the kinds of decisions we make during incidents.<\/p>\n<p>\u00a0\u00a0<small>Robert Barron \u2014 IBM<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.techtarget.com\/searchitoperations\/news\/366536578\/Observability-maven-cranky-about-AIOps-embraces-GPT\" target=\"_blank\" rel=\"noopener\">Observability maven \u2018cranky\u2019 about AIOps embraces GPT<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>I really debated about including this one, because I don\u2019t often include articles about new products, and Ii think especially critically when the the company in question is my employer.<\/p>\n<p>With all that in mind, I\u2019m including this one anyway because Charity Majors really put a fine point on exactly why I, too, am cranky about AIOps.<\/p>\n<p>\u00a0\u00a0<small>Beth Pariseau \u2014 TechTarget<\/small><br \/>\n\u00a0\u00a0<small><em>Full disclosure: Honeycomb, my employer, is mentioned.<\/em><\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/firehydrant.com\/blog\/assembly-time-is-where-you-have-the-most-control-of-an-incident\/\" target=\"_blank\" rel=\"noopener\">Assembly time is where you have the most control of an incident<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The main reason that MTTR is a flawed metric is that the nature of each incident varies so wildly.  Time to assemble, though, is much closer to being under our control.<\/p>\n<p>\u00a0\u00a0<small>Robert Ross \u2014 FireHydrant<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/incident.io\/blog\/understanding-incident-trige\" target=\"_blank\" rel=\"noopener\">How to improve incident triaging for better organization-wide incident response<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The folks at incident.io recommend being expansive in what is considered an incident and then using a defined process to find the real incidents, determine impact and priority, and assign to the right team for resolution.<\/p>\n<p>\u00a0\u00a0<small>Luis Gonzalez \u2014 incident.io<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/github.blog\/2023-05-03-github-availability-report-april-2023\/\" target=\"_blank\" rel=\"noopener\">GitHub Availability Report: April 2023<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>GitHub had some interesting incidents this time around, in several cases stemming from changes made with the intention of improving reliability.<\/p>\n<p>\u00a0\u00a0<small>Jakub Oleksy \u2014 GitHub<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/netflixtechblog.com\/migrating-critical-traffic-at-scale-with-no-downtime-part-1-ba1c7a1c7835\" target=\"_blank\" rel=\"noopener\">Migrating Critical Traffic At Scale with No Downtime \u2014 Part 1<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Netflix records and replays live traffic in a testbed environment in order to validate a migration plan before they ever impact real customers.<\/p>\n<p>\u00a0\u00a0<small>Shyam Gala, Javier Fernandez-Ivern, Anup Rokkam Pratap, and Devang Shah \u2014 Netflix<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.primevideotech.com\/video-streaming\/scaling-up-the-prime-video-audio-video-monitoring-service-and-reducing-costs-by-90\" target=\"_blank\" rel=\"noopener\">Scaling up the Prime Video audio\/video monitoring service and reducing costs by 90%<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The move from a distributed microservices architecture to a monolith application helped achieve higher scale, resilience, and reduce costs.<\/p>\n<p>I\u2019ve seen this sentiment more frequently recently.  Are we at the cusp of a general shift away from microservices?<\/p>\n<p>\u00a0\u00a0<small>Marcin Kolny \u2014 Amazon Prime Video<\/small><\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p>SRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, Rootly: Rootly is hiring for a Sr. Developer Relations Advocate to continue helping more world-class companies like Figma, NVIDIA, Squarespace, accelerate their incident management journey. Looking for previous on-call engineers with a passion for making the world a more reliable place.\u00a0 Learn more: https:\/\/rootly.com\/careers?gh_jid=4015888007 Articles Is there&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2023\/05\/08\/sre-weekly-issue-371\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #371<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-711","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":594,"url":"https:\/\/fde.cat\/index.php\/2022\/06\/06\/sre-weekly-issue-325\/","url_meta":{"origin":711,"position":0},"title":"SRE Weekly Issue #325","date":"June 6, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":489,"url":"https:\/\/fde.cat\/index.php\/2021\/10\/18\/sre-weekly-issue-292\/","url_meta":{"origin":711,"position":1},"title":"SRE Weekly Issue #292","date":"October 18, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo: https:\/\/rootly.io\/?utm_source=sreweekly Articles Four lessons every company should learn\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":653,"url":"https:\/\/fde.cat\/index.php\/2022\/11\/21\/sre-weekly-issue-348\/","url_meta":{"origin":711,"position":2},"title":"SRE Weekly Issue #348","date":"November 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly\u00a0\ud83d\ude92. Rootly automates manual tasks like creating an incident channel, Jira ticket and Zoom rooms, inviting responders, creating statuspage updates, postmortem timelines and more. Want to see why companies like Canva and Grammarly love us?:\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":543,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/21\/sre-weekly-issue-310\/","url_meta":{"origin":711,"position":3},"title":"SRE Weekly Issue #310","date":"February 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":579,"url":"https:\/\/fde.cat\/index.php\/2022\/05\/30\/sre-weekly-issue-324\/","url_meta":{"origin":711,"position":4},"title":"SRE Weekly Issue #324","date":"May 30, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":720,"url":"https:\/\/fde.cat\/index.php\/2023\/05\/29\/sre-weekly-issue-374\/","url_meta":{"origin":711,"position":5},"title":"SRE Weekly Issue #374","date":"May 29, 2023","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Rootly is hiring for a Sr. Developer Relations Advocate to continue helping more world-class companies like Figma, NVIDIA, Squarespace, accelerate their incident management journey. Looking for previous on-call engineers with a passion for making the world a more reliable place. Learn\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/711","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=711"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/711\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=711"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=711"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=711"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}