{"id":343,"date":"2021-08-31T14:39:28","date_gmt":"2021-08-31T14:39:28","guid":{"rendered":"https:\/\/fde.cat\/?p=343"},"modified":"2021-08-31T14:39:28","modified_gmt":"2021-08-31T14:39:28","slug":"sre-weekly-issue-282","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-282\/","title":{"rendered":"SRE Weekly Issue #282"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-282\/\" title=\"Permalink to SRE Weekly Issue #282\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, StackHawk:<\/h2>\n<p>ICYMI ZAP Creator and Project Lead Simon Bennetts recently unveiled ZAP\u2019s new automation framework. Watch the session and see how it works:<br \/>\n<a href=\"https:\/\/sthwk.com\/Automation-Framework\">https:\/\/sthwk.com\/Automation-Framework<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"http:\/\/www.brendangregg.com\/blog\/2019-08-19\/bpftrace.html\">A thorough introduction to bpftrace<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>I really need to learn bpftrace, and this article is a great place to start.<\/p>\n<p>Brendan Gregg<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/incident.io\/blog\/incidents-are-for-everyone\">Incidents are for everyone<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>If we expand our definition of \u201cincident\u201d beyond traditional engineering problems, we increase our opportunity for learning.<\/p>\n<p>Stephen Whitworth \u2014 incident.io<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/devops.com\/where-do-sres-go-from-here\/\">Where Do SREs Go From Here?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This is an interview with a director at Catchpoint about their 2021 SRE Report. They discuss two results from the survey: folks report a 15% decrease in toil and slow adoption of AIOps.<\/p>\n<p>Charlene O\u2019Hanlon \u2014 devops.com<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/dev.to\/devteam\/incident-retro-failing-comment-creation-erroneous-push-notifications-55dj\">Incident Retro: Failing Comment Creation + Erroneous Push Notifications <\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>A recurring theme in this story is that the incident was when folks learned how the push notifications work.<\/p>\n<p>Molly Struve \u2014 DEV<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.reddit.com\/r\/sre\/comments\/oy76w1\/dev_focused_sres_do_not_want_to_take_on\/\">r\/sre \u2013 Dev focused SREs do not want to take on operational tasks<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>In this reddit thread, a company hired some developers as SREs and then found that they didn\u2019t want to do operations work. Folks weigh on why and what to do.<\/p>\n<p>u\/red_flock and others \u2014 reddit<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.last9.io\/latency-slo\/\">Latency based SLO<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>How exactly do you want to phrase (and measure) an SLO about latency percentiles? Beware the subtle details.<\/p>\n<p>Piyush Verma \u2014 last9<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.blameless.com\/blog\/resilience-in-action-e9-vulnerability-compassion-and-post-incident-review\">Resilience in Action E9: Vulnerability, Compassion, and Post-Incident Reviews in the Emergency Room with Dr. Al\u2019ai Alvarez<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>I\u2019m definitely going to think on the great incident response and followup wisdom in this interview. My favorite:<\/p>\n<p>If I can change 1% to better that outcome, what is that 1%?<\/p>\n<p>Christina Tan \u2014 Blameless<\/p>\n<p><em>Full disclosure: Fastly, my employer, is mentioned.<\/em><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/surfingcomplexity.blog\/2021\/08\/05\/burned-by-let-it-burn\/\">Burned by \u2018let it burn\u2019<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Root cause: guessed wrong in the moment<\/p>\n<p>Lorin Hochstein<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/rootly.io\/blog\/incident-management-goes-to-the-olympics\">Incident Management Goes to the Olympics<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Here\u2019s a run-down of some IT mishaps from Olympic games past and present.<\/p>\n<p>Quentin Rousseau \u2014 Rootly<\/p>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<p><a href=\"https:\/\/twitter.com\/ZPostFacto\/status\/1423113896076840960\">Valve<\/a><br \/>\n<a href=\"https:\/\/www.cbsnews.com\/news\/spirit-airlines-flights-cancellations-delays-latest-2021-08-03\/\">Spirit Airlines<\/a><br \/>\n<a href=\"https:\/\/www.searchenginejournal.com\/wpx-hosting-outage\/415200\/\">WPX<\/a><br \/>\n<a href=\"https:\/\/piunikaweb.com\/2021\/08\/03\/is-instagram-down-and-not-working-again\/\">Instagram<\/a><br \/>\n<a href=\"https:\/\/www.the-sun.com\/tech\/3404412\/twitter-down-site-not-working\/\">Twitter<\/a><br \/>\n<a href=\"https:\/\/www.thesun.co.uk\/tech\/15784955\/onlyfans-down-models-complain-of-outage\/\">OnlyFans<\/a><br \/>\nSRE WEEKLY<\/p>\n","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, StackHawk: ICYMI ZAP Creator and Project Lead Simon Bennetts recently unveiled ZAP\u2019s new automation framework. Watch the session and see how it works: https:\/\/sthwk.com\/Automation-Framework Articles A thorough introduction to bpftrace I really need to learn bpftrace, and this article is a great place to start. Brendan Gregg&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-282\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #282<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-343","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":333,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-279\/","url_meta":{"origin":343,"position":0},"title":"SRE Weekly Issue #279","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: On July 28, ZAP Creator Simon Bennetts is giving a first look at ZAP\u2019s new automation framework. Grab your spot: https:\/\/sthwk.com\/ZAP-Automation Articles Managing the Risk of Cascading Failure This is a presentation by Laura Nolan (with text transcript) all about cascading\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":467,"url":"https:\/\/fde.cat\/index.php\/2021\/09\/20\/sre-weekly-issue-288\/","url_meta":{"origin":343,"position":1},"title":"SRE Weekly Issue #288","date":"September 20, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Want to see what\u2019s new with automated security tooling? Tune in on September 30 to see how StackHawk and Semgrep are making it possible to embed security testing in CI\/CD. https:\/\/sthwk.com\/whats-new-webinar Articles Tammy Bryant Butow on SRE Apprentices Faced with a\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":525,"url":"https:\/\/fde.cat\/index.php\/2021\/12\/28\/sre-netflix-at-srecon\/","url_meta":{"origin":343,"position":2},"title":"SRE Netflix at SRECon","date":"December 28, 2021","format":"video","excerpt":"190 Countries and 5 CORE SREs by Jonah Horowitz How does Netflix scale SRE? How do we manage over 70 million customers around the world without a 24\/7 operations center? With tens of thousands of Linux instances in a distributed system architecture, and thousands of daily production changes, it's an\u2026","rel":"","context":"In &quot;External&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":303,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-268\/","url_meta":{"origin":343,"position":3},"title":"SRE Weekly Issue #268","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Join StackHawk Tuesday May 4 at 9 am PT for a hands-on technical workshop! By the end of the session, you will have three types of security testing running in your GitHub pipeline. Register: http:\/\/sthwk.com\/technical-workshop Articles Manageable On-Call for Companies without\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":577,"url":"https:\/\/fde.cat\/index.php\/2022\/05\/16\/sre-weekly-issue-322\/","url_meta":{"origin":343,"position":4},"title":"SRE Weekly Issue #322","date":"May 16, 2022","format":false,"excerpt":"View on sreweekly.com Bit of a short issue this week. This morning, I stepped on my phone, crushing it mightily beneath my bootheel. Unfortunately a lot of my automation for reviewing articles is on there\u2026 thank goodness I have functioning backups. A message from our sponsor, Rootly: Manage incidents directly\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":832,"url":"https:\/\/fde.cat\/index.php\/2024\/03\/04\/sre-weekly-issue-414\/","url_meta":{"origin":343,"position":5},"title":"SRE Weekly Issue #414","date":"March 4, 2024","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: 91% of engineering leaders say they want a better alerting tool. The other 9% couldn\u2019t take the survey on their Blackberry. Meet Signals: a new standard in alerting and on call, now available. https:\/\/firehydrant.com\/blog\/alerting-and-on-call-scheduling-for-how-you-actually-work\/ 2024 VOID Report This year\u2019s VOID Report\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/343","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=343"}],"version-history":[{"count":1,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/343\/revisions"}],"predecessor-version":[{"id":367,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/343\/revisions\/367"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=343"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=343"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=343"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}