{"id":835,"date":"2024-03-11T01:32:37","date_gmt":"2024-03-11T01:32:37","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2024\/03\/11\/sre-weekly-issue-415\/"},"modified":"2024-03-11T01:32:37","modified_gmt":"2024-03-11T01:32:37","slug":"sre-weekly-issue-415","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2024\/03\/11\/sre-weekly-issue-415\/","title":{"rendered":"SRE Weekly Issue #415"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-415\/\" title=\"Permalink to SRE Weekly Issue #415\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/firehydrant.com\/\">FireHydrant<\/a>:<\/h2>\n<p>Join FireHydrant and talk shop with your DevOps peers on March 28! You\u2019ll gain a better understanding of what makes a fatigue-free on-call culture and how to implement practices to improve yours at this free, virtual roundtable.<br \/>\n<a href=\"https:\/\/app.livestorm.co\/firehydrant\/better-incidents-spring-bonfire-secrets-to-fatigue-free-on-call-in-2024\">https:\/\/app.livestorm.co\/firehydrant\/better-incidents-spring-bonfire-secrets-to-fatigue-free-on-call-in-2024<\/a><\/p>\n<\/div>\n<div class=\"wp-block-group is-layout-flow wp-block-group-is-layout-flow\">\n<div class=\"wp-block-group__inner-container\">\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/thenewstack.io\/the-wrong-way-to-use-dora-metrics\/\" target=\"_blank\" rel=\"noopener\">The Wrong Way to Use DORA Metrics<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>[\u2026] it must be said that the intent of these metrics was always to give an indicator of how well your team was delivering software, not a high-stakes metric that should be used, for example, to hire and fire team leads.<\/p>\n<p>\u00a0\u00a0<small>No\u010dnica Mellifera \u2014 The New Stack<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.readyset.io\/investigating-and-optimizing-over-querying\/\" target=\"_blank\" rel=\"noopener\">Investigating and Optimizing Over-Querying<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>A primer on the problems with N+1 database queries and how this pattern can sneak into your code whether you realize it or not.<\/p>\n<p>\u00a0\u00a0<small>neda \u2014 ReadySet<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/bravenewgeek.com\/choosing-good-slis\/\" target=\"_blank\" rel=\"noopener\">Choosing Good SLIs<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>A great explainer on choosing the right SLIs, starting with the Golden Signals and branching out.<\/p>\n<p>\u00a0\u00a0<small>Tyler Treat<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.alexewerlof.com\/p\/responsible-for-control\" target=\"_blank\" rel=\"noopener\">You should never be responsible for what you don\u2019t control<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>My favorite part about this is the \u201clatency budget\u201d question \u2014 which team\u2019s code gets to spend how much time doing its part to serve a request?<\/p>\n<p>\u00a0\u00a0<small>Alex Ewerl\u00f6f<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.palark.com\/sre-troubleshooting-ceph-systemd-containerd\/\" target=\"_blank\" rel=\"noopener\">An\u00a0unexpected crash due to\u00a0unrelated software changes<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Changes in two programs <em>outside<\/em> the container made Ceph suddenly grind to a halt, as detailed in this troubleshooting story.<\/p>\n<p>\u00a0\u00a0<small>Vladimir Guryanov \u2014 Palark<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/medium.com\/production-care\/how-to-set-a-good-only-one-threshold-for-an-alert-ddc00c975821\" target=\"_blank\" rel=\"noopener\">How to set a good only one threshold for an alert?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The word \u201cone\u201d is the key here, as the author argues for getting rid of \u201cwarning\u201d alerts entirely in favor of using only \u201ccritical\u201d.<\/p>\n<p>\u00a0\u00a0<small>Gauthier Fran\u00e7ois<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/medium.com\/@matt_weingarten\/creating-an-oncall-handoff-bot-7ee3f67d1033\" target=\"_blank\" rel=\"noopener\">Creating An Oncall Handoff Bot<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>They wrote a Slack bot to summarize open PagerDuty incidents every day.<\/p>\n<p>\u00a0\u00a0<small>Matt Weingarten<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.honeycomb.io\/blog\/negotiating-priorities-incident-investigations\" target=\"_blank\" rel=\"noopener\">Negotiating Priorities Around Incident Investigations<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The problems I\u2019ll explore in this blog\u2014from the SRE perspective\u2014are about time pressures (when to ship the investigation) and the type of report people expect.<\/p>\n<p>\u00a0\u00a0<small>Fred Hebert \u2014 Honeycomb<\/small><\/p>\n<p>\u00a0\u00a0<small><em>Full disclosure: Honeycomb is my employer.<\/em><\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/medium.com\/doctolib\/how-we-avoided-alarm-fatigue-syndrome-by-managing-reducing-the-alerting-noise-aac5c008d2e2\" target=\"_blank\" rel=\"noopener\">How we avoided alarm fatigue syndrome by managing\/reducing the alerting noise.<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>In order to reduce the noise, first they had to <em>define<\/em> noisy alerts and the KPIs they were looking to improve.<\/p>\n<p>\u00a0\u00a0<small>Gauthier Fran\u00e7ois \u2014 Doctolib<\/small><\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<p>SRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, FireHydrant: Join FireHydrant and talk shop with your DevOps peers on March 28! You\u2019ll gain a better understanding of what makes a fatigue-free on-call culture and how to implement practices to improve yours at this free, virtual roundtable. https:\/\/app.livestorm.co\/firehydrant\/better-incidents-spring-bonfire-secrets-to-fatigue-free-on-call-in-2024 The Wrong Way to Use DORA Metrics [\u2026]&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2024\/03\/11\/sre-weekly-issue-415\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #415<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-835","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":844,"url":"https:\/\/fde.cat\/index.php\/2024\/03\/25\/sre-weekly-issue-417\/","url_meta":{"origin":835,"position":0},"title":"SRE Weekly Issue #417","date":"March 25, 2024","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: Join FireHydrant this Thursday for a conversation about on-call burnout and how to prevent it. Get a better understanding of what makes a fatigue-free on-call culture, including real-world examples from your incident management peers. No sales, just shop talk. https:\/\/app.livestorm.co\/firehydrant\/better-incidents-spring-bonfire-secrets-to-fatigue-free-on-call-in-2024 Harnessing\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":798,"url":"https:\/\/fde.cat\/index.php\/2023\/12\/04\/sre-weekly-issue-401\/","url_meta":{"origin":835,"position":1},"title":"SRE Weekly Issue #401","date":"December 4, 2023","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: Join FireHydrant Dec.14 for a conversation about on-call culture and its effect on engineering organizations, featuring special guests from Outreach and Udemy. Gain a better understanding of what makes excellent on-call culture and how to implement practices to improve yours. https:\/\/app.livestorm.co\/firehydrant\/better-incidents-winter-bonfire-inside-on-call?type=detailed\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":847,"url":"https:\/\/fde.cat\/index.php\/2024\/04\/01\/sre-weekly-issue-418\/","url_meta":{"origin":835,"position":2},"title":"SRE Weekly Issue #418","date":"April 1, 2024","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: FireHydrant is now AI-powered for faster, smarter incidents! Power up your incidents with auto-generated real-time summaries, retrospectives, and status page updates. https:\/\/firehydrant.com\/blog\/ai-for-incident-management-is-here\/ Redefining Observability The observability waters have been muddy for awhile, and this article does a great job of taking\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":829,"url":"https:\/\/fde.cat\/index.php\/2024\/02\/26\/sre-weekly-issue-413\/","url_meta":{"origin":835,"position":3},"title":"SRE Weekly Issue #413","date":"February 26, 2024","format":false,"excerpt":"View on sreweekly.com Sorry about the automation fail and resend! That definitely wasn\u2019t issue #1. A message from our sponsor, FireHydrant: Check out how global payments company Dock uses FireHydrant to streamline and consolidate their incident management stack and reduce what they call \u201cmean time to combat.\u201dhttps:\/\/firehydrant.com\/blog\/the-revolution-in-critical-incident-response-at-dock-with-firehydrant\/ The Domain of\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":855,"url":"https:\/\/fde.cat\/index.php\/2024\/04\/15\/sre-weekly-issue-420\/","url_meta":{"origin":835,"position":4},"title":"SRE Weekly Issue #420","date":"April 15, 2024","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: FireHydrant is now AI-powered for faster, smarter incidents! Power up your incidents with auto-generated real-time summaries, retrospectives, and status page updates. https:\/\/firehydrant.com\/blog\/ai-for-incident-management-is-here\/ 1.0 Launch Retrospective The game Last Epoch launched in February, and they had a rocky start. This huge retrospective\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":823,"url":"https:\/\/fde.cat\/index.php\/2024\/02\/12\/sre-weekly-issue-411\/","url_meta":{"origin":835,"position":5},"title":"SRE Weekly Issue #411","date":"February 12, 2024","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: \u201cTo be honest, when can we switch?\u201d The first impressions are in. Check out what people are saying after seeing Signals, the new standard in alerting and on-call from FireHydrant, for the first time. https:\/\/firehydrant.com\/signals\/ Shared On-Call Is Where the SRE\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/835","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=835"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/835\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=835"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=835"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=835"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}