{"id":343,"date":"2021-08-31T14:39:28","date_gmt":"2021-08-31T14:39:28","guid":{"rendered":"https:\/\/fde.cat\/?p=343"},"modified":"2021-08-31T14:39:28","modified_gmt":"2021-08-31T14:39:28","slug":"sre-weekly-issue-282","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-282\/","title":{"rendered":"SRE Weekly Issue #282"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-282\/\" title=\"Permalink to SRE Weekly Issue #282\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, StackHawk:<\/h2>\n<p>ICYMI ZAP Creator and Project Lead Simon Bennetts recently unveiled ZAP\u2019s new automation framework. Watch the session and see how it works:<br \/>\n<a href=\"https:\/\/sthwk.com\/Automation-Framework\">https:\/\/sthwk.com\/Automation-Framework<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"http:\/\/www.brendangregg.com\/blog\/2019-08-19\/bpftrace.html\">A thorough introduction to bpftrace<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>I really need to learn bpftrace, and this article is a great place to start.<\/p>\n<p>Brendan Gregg<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/incident.io\/blog\/incidents-are-for-everyone\">Incidents are for everyone<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>If we expand our definition of \u201cincident\u201d beyond traditional engineering problems, we increase our opportunity for learning.<\/p>\n<p>Stephen Whitworth \u2014 incident.io<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/devops.com\/where-do-sres-go-from-here\/\">Where Do SREs Go From Here?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This is an interview with a director at Catchpoint about their 2021 SRE Report. They discuss two results from the survey: folks report a 15% decrease in toil and slow adoption of AIOps.<\/p>\n<p>Charlene O\u2019Hanlon \u2014 devops.com<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/dev.to\/devteam\/incident-retro-failing-comment-creation-erroneous-push-notifications-55dj\">Incident Retro: Failing Comment Creation + Erroneous Push Notifications <\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>A recurring theme in this story is that the incident was when folks learned how the push notifications work.<\/p>\n<p>Molly Struve \u2014 DEV<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.reddit.com\/r\/sre\/comments\/oy76w1\/dev_focused_sres_do_not_want_to_take_on\/\">r\/sre \u2013 Dev focused SREs do not want to take on operational tasks<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>In this reddit thread, a company hired some developers as SREs and then found that they didn\u2019t want to do operations work. Folks weigh on why and what to do.<\/p>\n<p>u\/red_flock and others \u2014 reddit<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.last9.io\/latency-slo\/\">Latency based SLO<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>How exactly do you want to phrase (and measure) an SLO about latency percentiles? Beware the subtle details.<\/p>\n<p>Piyush Verma \u2014 last9<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.blameless.com\/blog\/resilience-in-action-e9-vulnerability-compassion-and-post-incident-review\">Resilience in Action E9: Vulnerability, Compassion, and Post-Incident Reviews in the Emergency Room with Dr. Al\u2019ai Alvarez<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>I\u2019m definitely going to think on the great incident response and followup wisdom in this interview. My favorite:<\/p>\n<p>If I can change 1% to better that outcome, what is that 1%?<\/p>\n<p>Christina Tan \u2014 Blameless<\/p>\n<p><em>Full disclosure: Fastly, my employer, is mentioned.<\/em><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/surfingcomplexity.blog\/2021\/08\/05\/burned-by-let-it-burn\/\">Burned by \u2018let it burn\u2019<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Root cause: guessed wrong in the moment<\/p>\n<p>Lorin Hochstein<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/rootly.io\/blog\/incident-management-goes-to-the-olympics\">Incident Management Goes to the Olympics<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Here\u2019s a run-down of some IT mishaps from Olympic games past and present.<\/p>\n<p>Quentin Rousseau \u2014 Rootly<\/p>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<p><a href=\"https:\/\/twitter.com\/ZPostFacto\/status\/1423113896076840960\">Valve<\/a><br \/>\n<a href=\"https:\/\/www.cbsnews.com\/news\/spirit-airlines-flights-cancellations-delays-latest-2021-08-03\/\">Spirit Airlines<\/a><br \/>\n<a href=\"https:\/\/www.searchenginejournal.com\/wpx-hosting-outage\/415200\/\">WPX<\/a><br \/>\n<a href=\"https:\/\/piunikaweb.com\/2021\/08\/03\/is-instagram-down-and-not-working-again\/\">Instagram<\/a><br \/>\n<a href=\"https:\/\/www.the-sun.com\/tech\/3404412\/twitter-down-site-not-working\/\">Twitter<\/a><br \/>\n<a href=\"https:\/\/www.thesun.co.uk\/tech\/15784955\/onlyfans-down-models-complain-of-outage\/\">OnlyFans<\/a><br \/>\nSRE WEEKLY<\/p>\n","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, StackHawk: ICYMI ZAP Creator and Project Lead Simon Bennetts recently unveiled ZAP\u2019s new automation framework. Watch the session and see how it works: https:\/\/sthwk.com\/Automation-Framework Articles A thorough introduction to bpftrace I really need to learn bpftrace, and this article is a great place to start. Brendan Gregg&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-282\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #282<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-343","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":333,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-279\/","url_meta":{"origin":343,"position":0},"title":"SRE Weekly Issue #279","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: On July 28, ZAP Creator Simon Bennetts is giving a first look at ZAP\u2019s new automation framework. Grab your spot: https:\/\/sthwk.com\/ZAP-Automation Articles Managing the Risk of Cascading Failure This is a presentation by Laura Nolan (with text transcript) all about cascading\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":525,"url":"https:\/\/fde.cat\/index.php\/2021\/12\/28\/sre-netflix-at-srecon\/","url_meta":{"origin":343,"position":1},"title":"SRE Netflix at SRECon","date":"December 28, 2021","format":"video","excerpt":"190 Countries and 5 CORE SREs by Jonah Horowitz How does Netflix scale SRE? How do we manage over 70 million customers around the world without a 24\/7 operations center? With tens of thousands of Linux instances in a distributed system architecture, and thousands of daily production changes, it's an\u2026","rel":"","context":"In &quot;External&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":467,"url":"https:\/\/fde.cat\/index.php\/2021\/09\/20\/sre-weekly-issue-288\/","url_meta":{"origin":343,"position":2},"title":"SRE Weekly Issue #288","date":"September 20, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Want to see what\u2019s new with automated security tooling? Tune in on September 30 to see how StackHawk and Semgrep are making it possible to embed security testing in CI\/CD. https:\/\/sthwk.com\/whats-new-webinar Articles Tammy Bryant Butow on SRE Apprentices Faced with a\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":303,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-268\/","url_meta":{"origin":343,"position":3},"title":"SRE Weekly Issue #268","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Join StackHawk Tuesday May 4 at 9 am PT for a hands-on technical workshop! By the end of the session, you will have three types of security testing running in your GitHub pipeline. Register: http:\/\/sthwk.com\/technical-workshop Articles Manageable On-Call for Companies without\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":650,"url":"https:\/\/fde.cat\/index.php\/2022\/11\/14\/sre-weekly-issue-347\/","url_meta":{"origin":343,"position":4},"title":"SRE Weekly Issue #347","date":"November 14, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly\u00a0\ud83d\ude92. Rootly automates manual tasks like creating an incident channel, Jira ticket and Zoom rooms, inviting responders, creating statuspage updates, postmortem timelines and more. Want to see why companies like Canva and Grammarly love us?:\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":832,"url":"https:\/\/fde.cat\/index.php\/2024\/03\/04\/sre-weekly-issue-414\/","url_meta":{"origin":343,"position":5},"title":"SRE Weekly Issue #414","date":"March 4, 2024","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: 91% of engineering leaders say they want a better alerting tool. The other 9% couldn\u2019t take the survey on their Blackberry. Meet Signals: a new standard in alerting and on call, now available. https:\/\/firehydrant.com\/blog\/alerting-and-on-call-scheduling-for-how-you-actually-work\/ 2024 VOID Report This year\u2019s VOID Report\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/343","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=343"}],"version-history":[{"count":1,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/343\/revisions"}],"predecessor-version":[{"id":367,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/343\/revisions\/367"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=343"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=343"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=343"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}