{"id":298,"date":"2021-08-31T14:40:03","date_gmt":"2021-08-31T14:40:03","guid":{"rendered":"https:\/\/fde.cat\/?p=298"},"modified":"2021-08-31T14:40:03","modified_gmt":"2021-08-31T14:40:03","slug":"sre-weekly-issue-266","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-266\/","title":{"rendered":"SRE Weekly Issue #266"},"content":{"rendered":"<p><a href=\"http:\/\/sreweekly.com\/sre-weekly-issue-266\/\" title=\"Permalink to SRE Weekly Issue #266\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, StackHawk:<\/h2>\n<p>Are you a ZAP user looking to automate your security testing? Make sure to tune in to ZAPCon After Hours on Tuesday at 8 am PT to see how you can use Jenkins and Zest scripts to automate ZAP.<br \/>\n<a href=\"http:\/\/sthwk.com\/zapcon-ah\">http:\/\/sthwk.com\/zapcon-ah<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.theverge.com\/2021\/4\/9\/22375136\/airplane-flight-takes-off-heavier-than-expected-miss-ms-children\" target=\"_blank\" rel=\"noopener\">Airplane takes off a metric ton heavier than expected after computer error weighs adults as children<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This one was brought to my attention by Dr. Richard Cook, who also pointed me to the <a href=\"https:\/\/assets.publishing.service.gov.uk\/media\/604f423be90e077fdf88493f\/Boeing_737-8K5_G-TAWG_04-21.pdf\">AAIB incident report<\/a>.<\/p>\n<p>Dr. Cook went on to share these insights with me, which I\u2019ve copied here with permission:<\/p>\n<blockquote>\n<p>Note:<\/p>\n<ul>\n<li>the subtle interactions allowed the manual correction to be lost during the interval between recognizing the software problem and having the corrected software functionally \u2018catch\u2019 the Ms\/Miss title mixup;<\/li>\n<li>the incident is attributed to \u201ca simple flaw in the programming of the IT system\u201d rather than failure of the workarounds that were put in place after the problem was recognized;<\/li>\n<li>the report is careful to demonstrate that the flaws in the system made only a slight difference to the flight parameters;<\/li>\n<\/ul>\n<p>the report does not describe any IT process changes whatsoever!<\/p>\n<p>The report has the effect of making the incident appear to be an unfortunate series of occurrences rather than being emblematic of the way that these sorts of processes are vulnerable.<\/p>\n<\/blockquote>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.catchpoint.com\/press-releases\/catchpoint-announces-virtual-sre-community-event-on-june-10\" target=\"_blank\" rel=\"noopener\">Catchpoint Announces Virtual SRE Community Event on June 10<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Last year\u2019s SRE From Home event was awesome, and this year\u2019s iteration looks to be just as great.<\/p>\n<p><small>Catchpoint<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/mysteries.wizardzines.com\/connection-timeout.html\" target=\"_blank\" rel=\"noopener\">The Case of the Connection Timeout<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This is fun! Try your hand at troubleshooting a connection issue in this game-ified role-play scenario.<\/p>\n<p><strong>BONUS CONTENT<\/strong>: Read about the author\u2019s motivations, design decisions, and plans <a href=\"https:\/\/jvns.ca\/blog\/2021\/04\/16\/notes-on-debugging-puzzles\/\">here<\/a>.<\/p>\n<p><small>Julia Evans<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.informationweek.com\/strategic-cio\/enterprise-agility\/the-five-pillars-of-resilience-engineering\/a\/d-id\/1340623?_mc=rss_x_iwr_edt_aud_iw_x_x-rss-simple\" target=\"_blank\" rel=\"noopener\">The Five Pillars of Resilience Engineering<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Do we need to have some kind of Pillars Registry? Note, these are more like pillars of high availability than resilience engineering.<\/p>\n<p><small>Hector Aguilar \u2014 Okta<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/surfingcomplexity.blog\/2021\/04\/11\/incident-analysis-as-guerrilla-case-study-research\/\" target=\"_blank\" rel=\"noopener\">Incident analysis as guerrilla case study research<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>I love this idea that we\u2019re trying to get deep incident analysis done even though that may not be the actual goal of the organization.<\/p>\n<blockquote>\n<p>As LFI analysts, we\u2019re exploiting this desire for closure to justify spending time examining how work is really done inside of the system.<\/p>\n<\/blockquote>\n<p><small>Lorin Hochstein<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.blameless.com\/blog\/having-on-call-nightmares-runbooks-can-help-you-wake-up\" target=\"_blank\" rel=\"noopener\">Having On-call Nightmares? Runbooks can Help you Wake Up.<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This is well worth a read if only for the on-call scenario at the start. Yup, been there. We miss you, Harry.<\/p>\n<p><small>Harry Hull \u2014 Blameless<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.effx.com\/blog\/platform-engineering-vs-site-reliability-engineering\" target=\"_blank\" rel=\"noopener\">Platform engineering vs. site reliability engineering (SRE): here\u2019s what you need to know<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>What\u2019s the difference? Click through to learn about the distinction they\u2019re drawing.<\/p>\n<p><small>Amir Kazemi \u2014 effx<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/open.nytimes.com\/we-dont-get-bitter-we-get-better-b5d2783d5cd3\" target=\"_blank\" rel=\"noopener\">We Don\u2019t Get Bitter, We Get Better<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The New York Times\u2019s Operations Engineering group developed an Operational Maturity Assessment and uses it to have collaborative conversations with teams about their systems.<\/p>\n<p>Authro: The NYT Open Team \u2014 New York Times<\/p>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<ul class=\"sreweekly-outages\">\n<li><a href=\"https:\/\/www.google.com\/appsstatus#hl=en&amp;v=issue&amp;sid=4&amp;iid=a456acfa5bae03b9075bec69695292c6\">G-Suite<\/a>\n<ul class=\"sreweekly-outage\">\n<li class=\"sreweekly-outage\">Google posted this <a href=\"https:\/\/static.googleusercontent.com\/media\/www.google.com\/en\/\/appsstatus\/ir\/fxqtv40cmvuh2je.pdf\">\u201cMini Incident Report while full Incident Report is prepared.\u201d<\/a><\/li>\n<\/ul>\n<\/li>\n<li><a href=\"https:\/\/status.slack.com\/\/2021-04\/c111bf711d50f801\">Slack<\/a><\/li>\n<li><a href=\"https:\/\/status.docker.com\/pages\/incident\/533c6539221ae15e3f000031\/60787e0cfb9e67053616ba8a\">Docker Hub<\/a><\/li>\n<li><a href=\"https:\/\/seekingalpha.com\/news\/3682460-robinhood-restores-crypto-trading-after-outage-amid-unprecedented-demand\">Robinhood<\/a><\/li>\n<li><a href=\"https:\/\/www.cnet.com\/news\/twitter-outage-continues-for-some-users-problem-appears-to-be-global\/\">Twitter<\/a><\/li>\n<li><a href=\"https:\/\/reddit.statuspage.io\/incidents\/34tbvq5wdtm2\">Elevated CDN Errors<\/a><\/li>\n<li><a href=\"https:\/\/status.heroku.com\/incidents\/2224\">Heroku<\/a>\n<ul class=\"sreweekly-outage\">\n<li class=\"sreweekly-outage\">Heroku had a series of incidents this week (<a href=\"https:\/\/status.heroku.com\/incidents\/2224\">1<\/a>, <a href=\"https:\/\/status.heroku.com\/incidents\/2225\">2<\/a>, <a href=\"https:\/\/status.heroku.com\/incidents\/2226\">3<\/a>, <a href=\"https:\/\/status.heroku.com\/incidents\/2224\">4<\/a>).<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>SRE WEEKLY<\/p>\n","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, StackHawk: Are you a ZAP user looking to automate your security testing? Make sure to tune in to ZAPCon After Hours on Tuesday at 8 am PT to see how you can use Jenkins and Zest scripts to automate ZAP. http:\/\/sthwk.com\/zapcon-ah Articles Airplane takes off a metric&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-266\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #266<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-298","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":333,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-279\/","url_meta":{"origin":298,"position":0},"title":"SRE Weekly Issue #279","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: On July 28, ZAP Creator Simon Bennetts is giving a first look at ZAP\u2019s new automation framework. Grab your spot: https:\/\/sthwk.com\/ZAP-Automation Articles Managing the Risk of Cascading Failure This is a presentation by Laura Nolan (with text transcript) all about cascading\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":276,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-259\/","url_meta":{"origin":298,"position":1},"title":"SRE Weekly Issue #259","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Mark your calendars! The first conference for OWASP ZAP users is taking place March 9. Get your free ticket to connect with other ZAP users and learn about the project\u2019s roadmap http:\/\/sthwk.com\/zapcon-sreweekly Articles Increment: Reliability This quarter\u2019s Increment issue is about\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":304,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-269\/","url_meta":{"origin":298,"position":2},"title":"SRE Weekly Issue #269","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Tune into ZAPCon After Hours this Tuesday at 8 am PT to learn how to include automated security testing in your builds with ZAP http:\/\/sthwk.com\/after-hours-3 Articles Edgar: Solving Mysteries Faster with Observability We built Edgar to ease this burden, by empowering\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":320,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-275\/","url_meta":{"origin":298,"position":3},"title":"SRE Weekly Issue #275","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Join ZAP Founder & Project Lead Simon Bennetts on June 30 for a live AMA where he will be answering questions on all things open source and AppSec. Register: http:\/\/sthwk.com\/Simon-AMA Articles Practical Guide to SRE: Incident Severity Levels Here\u2019s a take\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":261,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-256\/","url_meta":{"origin":298,"position":4},"title":"SRE Weekly Issue #256","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Register now for the first-ever ZAPCon taking place March 9th. The free event will focus on OWASP ZAP and application security best practices. You wont want to miss it! http:\/\/sthwk.com\/zapcon-sre-weekly Articles Slack\u2019s Outage on January 4th 2021 Here\u2019s a blog post\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":343,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-282\/","url_meta":{"origin":298,"position":5},"title":"SRE Weekly Issue #282","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: ICYMI ZAP Creator and Project Lead Simon Bennetts recently unveiled ZAP\u2019s new automation framework. Watch the session and see how it works: https:\/\/sthwk.com\/Automation-Framework Articles A thorough introduction to bpftrace I really need to learn bpftrace, and this article is a great\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/298","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=298"}],"version-history":[{"count":1,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/298\/revisions"}],"predecessor-version":[{"id":412,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/298\/revisions\/412"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=298"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=298"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=298"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}