{"id":737,"date":"2023-07-23T14:37:18","date_gmt":"2023-07-23T14:37:18","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2023\/07\/23\/sre-weekly-issue-382\/"},"modified":"2023-07-23T14:37:18","modified_gmt":"2023-07-23T14:37:18","slug":"sre-weekly-issue-382","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2023\/07\/23\/sre-weekly-issue-382\/","title":{"rendered":"SRE Weekly Issue #382"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-382\/\" title=\"Permalink to SRE Weekly Issue #382\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/rootly.com\/demo\/?utm_source=sreweekly\">Rootly<\/a>:<\/h2>\n<p>Eliminate the anxiety around declaring an incident for nebulous problems by introducing a triage phase into your incident management process. Our latest blog posts dives into why the triage phase is so important, and how you can automate yours with Rootly.<\/p>\n<p>Read more on the Rootly blog:<br \/>\n<a href=\"https:\/\/rootly.com\/blog\/improve-visibility-and-capture-more-data-with-triage-incidents\">https:\/\/rootly.com\/blog\/improve-visibility-and-capture-more-data-with-triage-incidents<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"wp-block-group\">\n<div class=\"wp-block-group__inner-container\">\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/redpanda.com\/blog\/solve-out-of-memory-killer-events\" target=\"_blank\" rel=\"noopener\">Solving challenges caused by Out Of Memory (OOM) Killer in Linux<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The Linux OOM killer can already be a bugbear, and things only get more complicated when you add containers to the mix.<\/p>\n<p>\u00a0\u00a0<small>Rafa\u0142 Korepta \u2014 RedPanda<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/firehydrant.com\/blog\/align-platform-and-product-engineering-teams-over-incidents\/\" target=\"_blank\" rel=\"noopener\">Align platform and product engineering teams over incidents<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This post explores how to align platform and product engineering teams by implementing business value proxy metrics and using incidents to inform them.<\/p>\n<p>The same metrics that we use to measure other initiatives against business priorities may be able to show us whether our incident response process is effective.<\/p>\n<p>\u00a0\u00a0<small>Gonzalo Maldonado \u2014 FireHydrant<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/medium.com\/@diogo.souzasre\/devops-vs-sre-is-it-a-party-40d1a305ae4b\" target=\"_blank\" rel=\"noopener\">DevOps vs SRE: Is it a party?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Here\u2019s another take on devops vs SRE, using a metaphor of organizing a party.<\/p>\n<p>\u00a0\u00a0<small>Diogo Souza<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/thenewstack.io\/embrace-ai-acceleration-by-investing-in-reliability\/\" target=\"_blank\" rel=\"noopener\">Embrace AI Acceleration by Investing in Reliability<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>how do you balance taking advantage of the acceleration and innovation of AI while not compromising reliability and losing users?<\/p>\n<p>\u00a0\u00a0<small>Jim Gochee \u2014 The New Stack<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"http:\/\/businessnewsthisweek.com\/business\/human-error-is-the-scapegoat-for-systemic-and-organizational-failures\/\" target=\"_blank\" rel=\"noopener\">\u201cHuman Error\u201d is the Scapegoat for Systemic and Organizational Failures<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>My favorite part is the bit about the risks of automation and keeping humans in the loop.<\/p>\n<p>\u00a0\u00a0<small>Dr. Mica Endsley \u2014 Business News This Week<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/dzone.com\/articles\/revolutionizing-infrastructure-management-the-powe\" target=\"_blank\" rel=\"noopener\">Revolutionizing Infrastructure Management: The Power of Feature Flags in IaC<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>It\u2019s about reliability: IaC changes carry just as much risk to reliability as product code changes, if not more.  How can we bring feature flags to IaC?<\/p>\n<p>\u00a0\u00a0<small>Josephine E. Justin, Srikanth Murali, and Norton Stanley S A \u2014 DZone<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/certomodo.substack.com\/p\/on-call-stories-flying-blind\" target=\"_blank\" rel=\"noopener\">On-Call Stories: Flying Blind<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Oh, the tangled web we weave when we send automated emails.<\/p>\n<p>\u00a0\u00a0<small>Amin Astaneh \u2014 Certo Modo<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"http:\/\/highscalability.com\/blog\/2023\/7\/16\/lessons-learned-running-presto-at-meta-scale.html\" target=\"_blank\" rel=\"noopener\">Lessons Learned Running Presto at Meta\u00a0Scale<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Here are four things we learned while scaling up Presto to Meta scale, and some advice if you\u2019re interested in running your own queries at scale.<\/p>\n<p>\u00a0\u00a0<small>High Scalability<\/small><\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p>SRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, Rootly: Eliminate the anxiety around declaring an incident for nebulous problems by introducing a triage phase into your incident management process. Our latest blog posts dives into why the triage phase is so important, and how you can automate yours with Rootly. Read more on the Rootly&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2023\/07\/23\/sre-weekly-issue-382\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #382<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-737","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":739,"url":"https:\/\/fde.cat\/index.php\/2023\/07\/30\/sre-weekly-issue-383\/","url_meta":{"origin":737,"position":0},"title":"SRE Weekly Issue #383","date":"July 30, 2023","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Eliminate the anxiety around declaring an incident for nebulous problems by introducing a triage phase into your incident management process. Our latest blog posts dives into why the triage phase is so important, and how you can automate yours with Rootly.\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":543,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/21\/sre-weekly-issue-310\/","url_meta":{"origin":737,"position":1},"title":"SRE Weekly Issue #310","date":"February 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":746,"url":"https:\/\/fde.cat\/index.php\/2023\/08\/14\/sre-weekly-issue-385\/","url_meta":{"origin":737,"position":2},"title":"SRE Weekly Issue #385","date":"August 14, 2023","format":false,"excerpt":"View on sreweekly.com Many apologies to Matt Cooper at GitHub, who is the actual author of the article Scaling Merge-ort Across GitHub from last week. Sorry for the mis-credit, Matt! A message from our sponsor, Rootly: When incidents impact your customers, failing to communicate with them effectively can erode trust\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":535,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/24\/sre-weekly-issue-306\/","url_meta":{"origin":737,"position":3},"title":"SRE Weekly Issue #306","date":"January 24, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":579,"url":"https:\/\/fde.cat\/index.php\/2022\/05\/30\/sre-weekly-issue-324\/","url_meta":{"origin":737,"position":4},"title":"SRE Weekly Issue #324","date":"May 30, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":546,"url":"https:\/\/fde.cat\/index.php\/2022\/03\/07\/sre-weekly-issue-312\/","url_meta":{"origin":737,"position":5},"title":"SRE Weekly Issue #312","date":"March 7, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/737","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=737"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/737\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=737"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=737"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=737"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}