{"id":807,"date":"2023-12-25T03:30:10","date_gmt":"2023-12-25T03:30:10","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2023\/12\/25\/sre-weekly-issue-404\/"},"modified":"2023-12-25T03:30:10","modified_gmt":"2023-12-25T03:30:10","slug":"sre-weekly-issue-404","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2023\/12\/25\/sre-weekly-issue-404\/","title":{"rendered":"SRE Weekly Issue #404"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-404\/\" title=\"Permalink to SRE Weekly Issue #404\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/firehydrant.com\/\">FireHydrant<\/a>:<\/h2>\n<p>Looking to cozy up with a good read this week? Check out \u201cYour guide to better status pages.\u201d It\u2019s a mini masterclass on how to better communicate on your status pages.<br \/>\n<a href=\"https:\/\/firehydrant.com\/blog\/your-guide-to-better-incident-status-pages\/\">https:\/\/firehydrant.com\/blog\/your-guide-to-better-incident-status-pages\/<\/a><\/p>\n<\/div>\n<div class=\"wp-block-group is-layout-flow wp-block-group-is-layout-flow\">\n<div class=\"wp-block-group__inner-container\">\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.alexewerlof.com\/p\/10x9\" target=\"_blank\" rel=\"noopener\">Rule of 10x per 9<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>For every 9 you add to SLO, you\u2019re making the system 10x more reliable but also 10x more expensive.<\/p>\n<p>\u00a0\u00a0<small>Alex Ewerl\u00f6f<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/rachelbythebay.com\/w\/2023\/12\/07\/header\/\" target=\"_blank\" rel=\"noopener\">Patching around a C++ crash with a little bit of Lua<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>In this incident story, the feature flags were served by the main application server.  When a new feature caused the server to crash, there was no way to flip the flag back off to stop the crashes.<\/p>\n<p>\u00a0\u00a0<small>rachelbythebay<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/humanisticsystems.com\/2023\/11\/11\/set-taxonomies-to-neutral\/\" target=\"_blank\" rel=\"noopener\">Set Taxonomies to Neutral<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The author of a classification system for human error reflects 20 years later on the harm that such systems can cause by using deficit-based language.<\/p>\n<p>\u00a0\u00a0<small>Dr. Steven Shorrock<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.thevoid.community\/incident-detail?recordId=rec7OIQslY0aYVqjA\" target=\"_blank\" rel=\"noopener\">Post Mortem on VOID Report: Cloudflare Control Plane and Analytics Outage<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Here\u2019s Fred Hebert\u2019s analysis of Cloudflare\u2019s write-up of their incident on November 2.<\/p>\n<p>I\u2019m hoping they\u2019re going to do a more in-depth review.<\/p>\n<p>\u00a0\u00a0<small>Fred Hebert \u2014 VOID<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/odigos.io\/blog\/Integrating-manual-and-auto\" target=\"_blank\" rel=\"noopener\">Integrating manual with automatic instrumentation<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>In this post, we introduce a hybrid approach that seamlessly combines the precision of manual instrumentation with the comfort, efficiency, and performance of automatic instrumentation.<\/p>\n<p>\u00a0\u00a0<small>Ron Federman \u2014 Odigos<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"http:\/\/highscalability.com\/blog\/2023\/8\/16\/the-swedbank-outage-shows-that-change-controls-dont-work.html\" target=\"_blank\" rel=\"noopener\">The Swedbank Outage shows that Change Controls don\u2019t\u00a0work<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Change is not the problem. It\u2019s unaddressed risk<\/p>\n<p>\u00a0\u00a0<small>Bruce Johnston \u2014 High Scalability<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/ayende.com\/blog\/200289-B\/production-postmortem-the-spawn-of-denial-of-service?Key=afdb85dc-3822-4528-b03b-456fceeb4409\" target=\"_blank\" rel=\"noopener\"> Production Postmortem: The Spawn of Denial of Service<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>A shell script with a loop running a DB client can fill up your ephemeral ports in a hurry.<\/p>\n<p>\u00a0\u00a0<small>Oren Eini \u2014 RavenDB<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/hart-michael.medium.com\/writing-code-is-the-same-thing-as-writing-prose-ba8d04ed6917\" target=\"_blank\" rel=\"noopener\">Writing Code is the Same Thing as Writing Prose<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>When you get right down to it, it\u2019s all human communication, even assembly code.  It\u2019s human factors all the way down.<\/p>\n<p>\u00a0\u00a0<small>Michael Hart<\/small><\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p>SRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, FireHydrant: Looking to cozy up with a good read this week? Check out \u201cYour guide to better status pages.\u201d It\u2019s a mini masterclass on how to better communicate on your status pages. https:\/\/firehydrant.com\/blog\/your-guide-to-better-incident-status-pages\/ Rule of 10x per 9 For every 9 you add to SLO, you\u2019re making&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2023\/12\/25\/sre-weekly-issue-404\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #404<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-807","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":498,"url":"https:\/\/fde.cat\/index.php\/2021\/11\/01\/sre-weekly-issue-294\/","url_meta":{"origin":807,"position":0},"title":"SRE Weekly Issue #294","date":"November 1, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo: https:\/\/rootly.com\/?utm_source=sreweekly Articles Five Steps To Reduce SRE Toil\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":797,"url":"https:\/\/fde.cat\/index.php\/2023\/11\/27\/sre-weekly-issue-400\/","url_meta":{"origin":807,"position":1},"title":"SRE Weekly Issue #400","date":"November 27, 2023","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: How is FireHydrant building its alerting tool, Signals, to be robust, lightning-fast, and configurable to how YOU work? In this edition, of their Captain\u2019s Log, they dive into CEL and how they\u2019re using it to handle routing and logic. https:\/\/firehydrant.com\/blog\/captains-log-how-were-leveraging-cel\/ A\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":725,"url":"https:\/\/fde.cat\/index.php\/2023\/06\/19\/sre-weekly-issue-377\/","url_meta":{"origin":807,"position":2},"title":"SRE Weekly Issue #377","date":"June 19, 2023","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Curious how companies like Figma, Tripadvisor, and 100s of others leverage Rootly to manage incidents in Slack and unlock instant best practices? Check out this lightning demo: https:\/\/www.loom.com\/share\/051c4be0425a436e888dc0c3690855ad Articles Why did AWS Support fail with US-EAST-1 again? AWS had a major\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":627,"url":"https:\/\/fde.cat\/index.php\/2022\/09\/05\/sre-weekly-issue-337\/","url_meta":{"origin":807,"position":3},"title":"SRE Weekly Issue #337","date":"September 5, 2022","format":false,"excerpt":"View on sreweekly.com Thanks for all the vacation well-wishes! It was really great and relaxing. Take vacations, it\u2019s important for reliability! While I was out, I shipped the past two issues with content prepared in advance, and without the Outages section. This gave me a chance to really think hard\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":603,"url":"https:\/\/fde.cat\/index.php\/2022\/07\/04\/sre-weekly-issue-329\/","url_meta":{"origin":807,"position":4},"title":"SRE Weekly Issue #329","date":"July 4, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":607,"url":"https:\/\/fde.cat\/index.php\/2022\/07\/18\/sre-weekly-issue-330\/","url_meta":{"origin":807,"position":5},"title":"SRE Weekly Issue #330","date":"July 18, 2022","format":false,"excerpt":"View on sreweekly.com Thanks for all the well-wishes as I took a sick day last week. I\u2019m feeling much better! A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/807","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=807"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/807\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=807"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=807"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=807"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}