{"id":739,"date":"2023-07-30T15:22:23","date_gmt":"2023-07-30T15:22:23","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2023\/07\/30\/sre-weekly-issue-383\/"},"modified":"2023-07-30T15:22:23","modified_gmt":"2023-07-30T15:22:23","slug":"sre-weekly-issue-383","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2023\/07\/30\/sre-weekly-issue-383\/","title":{"rendered":"SRE Weekly Issue #383"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-383\/\" title=\"Permalink to SRE Weekly Issue #383\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/rootly.com\/demo\/?utm_source=sreweekly\">Rootly<\/a>:<\/h2>\n<p>Eliminate the anxiety around declaring an incident for nebulous problems by introducing a triage phase into your incident management process. Our latest blog posts dives into why the triage phase is so important, and how you can automate yours with Rootly.<\/p>\n<p>Read more on the Rootly blog:<br \/>\n<a href=\"https:\/\/rootly.com\/blog\/improve-visibility-and-capture-more-data-with-triage-incidents\">https:\/\/rootly.com\/blog\/improve-visibility-and-capture-more-data-with-triage-incidents<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"wp-block-group\">\n<div class=\"wp-block-group__inner-container\">\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.infoq.com\/presentations\/sre-scale-sociotechnical\/\" target=\"_blank\" rel=\"noopener\">The Endgame of SRE <\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This delightful talk explores what SRE can look like in practical terms by learning about the sociotechnical situation at a fictitious company.  To do that, Amy Tobey plays a game she created, walking through a town and talking to NPCs.<\/p>\n<p>\u00a0\u00a0<small>Amy Tobey \u2014 InfoQ<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/status.honeycomb.io\/incidents\/fp0zjwwpp8bw\" target=\"_blank\" rel=\"noopener\">Querying &amp; Ingest outage [Honeycomb]<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Honeycomb had a major outage last tuesday, and they posted this interim outage report on their status page.<\/p>\n<p>Note: Honeycomb is my employer, and I proofread this article.<\/p>\n<p>\u00a0\u00a0<small>Honeycomb<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.codereliant.io\/the-system-resiliency-pyramid\/\" target=\"_blank\" rel=\"noopener\">The System Resiliency Pyramid<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The system resiliency pyramid provides a holistic framework for thinking about reliability across five key layers.<\/p>\n<p>I like the way this system of layers breaks down the multiple different aspects of reliability.<\/p>\n<p>\u00a0\u00a0<small>Code Reliant<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.fluxninja.com\/blog\/traffic-jams-in-the-cloud-unveiling-the-true-enemy-of-reliability\" target=\"_blank\" rel=\"noopener\">Traffic Jams in the Cloud: Are Overloads Sabotaging Your Application\u2019s Reliability?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This article explores system overload using a traffic congestion analogy.  I especially like the note about failover as a <em>cause<\/em> of an overload condition.<\/p>\n<p>\u00a0\u00a0<small>Tanveer Gill \u2014 FluxNinja<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/incident.io\/blog\/dora-metrics-change-failure-rate\" target=\"_blank\" rel=\"noopener\">Driving successful change: Understanding DORA\u2019s Change Failure Rate metric<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>in this article, I\u2019ll dive into this vital DORA metric, detail its benchmarks, and provide practical insights to help you drive more frequent successful changes.<\/p>\n<p>\u00a0\u00a0<small>incident.io<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.codereliant.io\/rate-limiting-deep-dive\/\" target=\"_blank\" rel=\"noopener\">Slow Down! Rate Limiting Deep Dive<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This article explains four different rate limiting algorithms and includes code snippets in Java.<\/p>\n<p>\u00a0\u00a0<small>Code Reliant<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.orioledata.com\/blog\/no-more-vacuum-in-postgresql\/\" target=\"_blank\" rel=\"noopener\">PostgreSQL: No More VACUUM, No More Bloat<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>PostgreSQL vacuuming can be a total pain \u2014 and a serious threat to performance and reliability.  This new database engine sounds pretty interesting.<\/p>\n<p>\u00a0\u00a0<small>Oriole<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/nathanpeck.com\/rethinking-infrastructure-as-code-from-scratch\/\" target=\"_blank\" rel=\"noopener\">Rethinking infrastructure as code from scratch<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Current IaC tools are like plain HTML, says this author, and we should have something like CSS to avoid repeating ourselves.<\/p>\n<p>\u00a0\u00a0<small>Nathan Peck<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.pagerduty.com\/blog\/10-years-of-failure-friday-at-pagerduty-fostering-resilience-learning-and-reliability\/\" target=\"_blank\" rel=\"noopener\">10 Years of Failure Friday at PagerDuty: Fostering Resilience, Learning and Reliability<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>PagerDuty looks back on a decade of weekly chaos experiments and shares advice on starting your own similar program.<\/p>\n<p>\u00a0\u00a0<small>Cristina Dias \u2014 PagerDuty<\/small><\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p>SRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, Rootly: Eliminate the anxiety around declaring an incident for nebulous problems by introducing a triage phase into your incident management process. Our latest blog posts dives into why the triage phase is so important, and how you can automate yours with Rootly. Read more on the Rootly&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2023\/07\/30\/sre-weekly-issue-383\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #383<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-739","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":737,"url":"https:\/\/fde.cat\/index.php\/2023\/07\/23\/sre-weekly-issue-382\/","url_meta":{"origin":739,"position":0},"title":"SRE Weekly Issue #382","date":"July 23, 2023","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Eliminate the anxiety around declaring an incident for nebulous problems by introducing a triage phase into your incident management process. Our latest blog posts dives into why the triage phase is so important, and how you can automate yours with Rootly.\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":543,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/21\/sre-weekly-issue-310\/","url_meta":{"origin":739,"position":1},"title":"SRE Weekly Issue #310","date":"February 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":746,"url":"https:\/\/fde.cat\/index.php\/2023\/08\/14\/sre-weekly-issue-385\/","url_meta":{"origin":739,"position":2},"title":"SRE Weekly Issue #385","date":"August 14, 2023","format":false,"excerpt":"View on sreweekly.com Many apologies to Matt Cooper at GitHub, who is the actual author of the article Scaling Merge-ort Across GitHub from last week. Sorry for the mis-credit, Matt! A message from our sponsor, Rootly: When incidents impact your customers, failing to communicate with them effectively can erode trust\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":535,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/24\/sre-weekly-issue-306\/","url_meta":{"origin":739,"position":3},"title":"SRE Weekly Issue #306","date":"January 24, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":579,"url":"https:\/\/fde.cat\/index.php\/2022\/05\/30\/sre-weekly-issue-324\/","url_meta":{"origin":739,"position":4},"title":"SRE Weekly Issue #324","date":"May 30, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":546,"url":"https:\/\/fde.cat\/index.php\/2022\/03\/07\/sre-weekly-issue-312\/","url_meta":{"origin":739,"position":5},"title":"SRE Weekly Issue #312","date":"March 7, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/739","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=739"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/739\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=739"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=739"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=739"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}