{"id":775,"date":"2023-10-23T01:13:33","date_gmt":"2023-10-23T01:13:33","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2023\/10\/23\/sre-weekly-issue-395\/"},"modified":"2023-10-23T01:13:33","modified_gmt":"2023-10-23T01:13:33","slug":"sre-weekly-issue-395","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2023\/10\/23\/sre-weekly-issue-395\/","title":{"rendered":"SRE Weekly Issue #395"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-395\/\" title=\"Permalink to SRE Weekly Issue #395\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/firehydrant.com\/\">FireHydrant<\/a>:<\/h2>\n<p>Incident management platform FireHydrant is combining alerting and incident response in one ring-to-retro tool. Sign up for the early access waitlist and be the first to experience the power of alerting + incident response in one platform at last.<br \/>\n<a href=\"https:\/\/firehydrant.com\/signals\/\">https:\/\/firehydrant.com\/signals\/<\/a><\/p>\n<\/div>\n<div class=\"wp-block-group\">\n<div class=\"wp-block-group__inner-container\">\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/robertovitillo.com\/what-every-developer-should-know-about-database-consistency\/\" target=\"_blank\" rel=\"noopener\">What every developer should know about database consistency<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This article gives an overview of database consistency models and introduces the PACELC Theorem.<\/p>\n<p>\u00a0\u00a0<small>Roberto Vitillo<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.codereliant.io\/what-is-a-memory-leak\/\" target=\"_blank\" rel=\"noopener\">What is a Memory leak? Causes | Detection | Tools | Golang<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>A primer on memory and resource leaks, including some lesser-known causes.<\/p>\n<p>\u00a0\u00a0<small>Code Reliant<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.honeycomb.io\/rescue-struggling-pods-from-scratch\" target=\"_blank\" rel=\"noopener\">Rescue Struggling Pods from Scratch<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>How can you troubleshoot a broken pod when it\u2019s built FROM scratch and you can\u2019t even run a shell in it?<\/p>\n<p>\u00a0\u00a0<small>Mike Terhar<\/small><br \/>\u00a0\u00a0<small><em>Full disclosure: Honeycomb is my employer.<\/em><\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.gremlin.com\/\/blog\/five-mindset-shifts-for-effective-reliability-programs\/\" target=\"_blank\" rel=\"noopener\">Five mindset shifts for effective reliability programs<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This article explains why reliability isn\u2019t just a one-off project that you can bolt on and move on.<\/p>\n<p>\u00a0\u00a0<small>Gavin Cahill \u2014 Gremlin<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/doordash.engineering\/2023\/08\/15\/bpfagent-ebpf-for-monitoring-at-doordash\/\" target=\"_blank\" rel=\"noopener\">BPFAgent: eBPF for Monitoring at DoorDash<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>DoorDash wanted consistent observability across their infrastructure that didn\u2019t depend on instrumenting each application.  To solve this, they developed BPFAgent, and this article explains how.<\/p>\n<p>\u00a0\u00a0<small>Patrick Rogers \u2014 DoorDash<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.techtarget.com\/searchnetworking\/definition\/mean-time-to-innocence\" target=\"_blank\" rel=\"noopener\">What is Mean Time to Innocence?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Mean time to innocence is the average elapsed time between when a system problem is detected and any given team\u2019s ability to say the team or part of its system is not the root cause of the problem.<\/p>\n<p>This article, of course, is about <em>not<\/em> having a culture like that.<\/p>\n<p>\u00a0\u00a0<small>John Burke \u2014 TechTarget<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.pulumi.com\/blog\/post-mortem-2023-10-06\/\" target=\"_blank\" rel=\"noopener\">Details of the Pulumi Outage on October 6, 2023<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>It was the DB \u2014 more specifically, it was a DB migration with unintended locking.<\/p>\n<p>\u00a0\u00a0<small>Casey Huang \u2014 Pulumi<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/status.cloud.google.com\/incidents\/U39RSGjaANJXtjHpRkdq\" target=\"_blank\" rel=\"noopener\">Google Cloud Networking Incident Report (2023-10-05)<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The incident stemmed from a control plane change that worked in some regions but caused OOMs in others.<\/p>\n<p>\u00a0\u00a0<small>Google<\/small><\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p>SRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, FireHydrant: Incident management platform FireHydrant is combining alerting and incident response in one ring-to-retro tool. Sign up for the early access waitlist and be the first to experience the power of alerting + incident response in one platform at last. https:\/\/firehydrant.com\/signals\/ What every developer should know about&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2023\/10\/23\/sre-weekly-issue-395\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #395<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-775","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":781,"url":"https:\/\/fde.cat\/index.php\/2023\/11\/06\/sre-weekly-issue-397\/","url_meta":{"origin":775,"position":0},"title":"SRE Weekly Issue #397","date":"November 6, 2023","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: Incident management platform FireHydrant is combining alerting and incident response in one ring-to-retro tool. Sign up for the early access waitlist and be the first to experience the power of alerting + incident response in one platform at last. https:\/\/firehydrant.com\/signals\/ Modern\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":786,"url":"https:\/\/fde.cat\/index.php\/2023\/11\/13\/sre-weekly-issue-398\/","url_meta":{"origin":775,"position":1},"title":"SRE Weekly Issue #398","date":"November 13, 2023","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: \u201cChange is the essential process of all existence.\u201d \u2013 Spock It\u2019s time for alerting to evolve. Get a first look at how incident management platform FireHydrant is architecting Signals, its native alerting tool, for resilience in the Signals Captain\u2019s Log. https:\/\/firehydrant.com\/blog\/captains-log-a-first-look-at-our-architecture-for-signals\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":797,"url":"https:\/\/fde.cat\/index.php\/2023\/11\/27\/sre-weekly-issue-400\/","url_meta":{"origin":775,"position":2},"title":"SRE Weekly Issue #400","date":"November 27, 2023","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: How is FireHydrant building its alerting tool, Signals, to be robust, lightning-fast, and configurable to how YOU work? In this edition, of their Captain\u2019s Log, they dive into CEL and how they\u2019re using it to handle routing and logic. https:\/\/firehydrant.com\/blog\/captains-log-how-were-leveraging-cel\/ A\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":874,"url":"https:\/\/fde.cat\/index.php\/2024\/06\/10\/sre-weekly-issue-428\/","url_meta":{"origin":775,"position":3},"title":"SRE Weekly Issue #428","date":"June 10, 2024","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: We\u2019ve gone all out on our new integration with Microsoft Teams. If you\u2019re a MS Teams user, FireHydrant now supports the most comprehensive integration for incident management. Run the entire IM process without ever leaving the chat. https:\/\/firehydrant.com\/blog\/introducing-a-brand-new-microsoft-teams-integration\/ The Reverse Red\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":862,"url":"https:\/\/fde.cat\/index.php\/2024\/05\/06\/sre-weekly-issue-423\/","url_meta":{"origin":775,"position":4},"title":"SRE Weekly Issue #423","date":"May 6, 2024","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: FireHydrant is now AI-powered for faster, smarter incidents! Power up your incidents with auto-generated real-time summaries, retrospectives, and status page updates. https:\/\/firehydrant.com\/blog\/ai-for-incident-management-is-here\/ How to Fight Alert Fatigue with Synthetic Monitoring This one\u2019s full of great advice about making sure alerts are\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":819,"url":"https:\/\/fde.cat\/index.php\/2024\/02\/05\/sre-weekly-issue-410\/","url_meta":{"origin":775,"position":5},"title":"SRE Weekly Issue #410","date":"February 5, 2024","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: How many seats are you paying for in your legacy alerting tool that rarely get paged? With Signals\u2019 bucket pricing, you only pay for what you use. Join the beta for a better tool at a better price. https:\/\/firehydrant.com\/blog\/signals-beta-live\/ Staying in\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/775","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=775"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/775\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=775"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=775"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=775"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}