{"id":280,"date":"2021-08-31T14:40:23","date_gmt":"2021-08-31T14:40:23","guid":{"rendered":"https:\/\/fde.cat\/?p=280"},"modified":"2021-08-31T14:40:23","modified_gmt":"2021-08-31T14:40:23","slug":"sre-weekly-issue-260","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-260\/","title":{"rendered":"SRE Weekly Issue #260"},"content":{"rendered":"<p><a href=\"http:\/\/sreweekly.com\/sre-weekly-issue-260\/\" title=\"Permalink to SRE Weekly Issue #260\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, StackHawk:<\/h2>\n<p>Check out this guide to modern dynamic application security testing to learn how it works and what to look for in tooling.<br \/>\n<a href=\"http:\/\/sthwk.com\/dynamic-appsec-overview\">http:\/\/sthwk.com\/dynamic-appsec-overview<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/increment.com\/reliability\/resilience-engineering-david-woods\/\" target=\"_blank\" rel=\"noopener\">[Increment: Reliability] Interview: Dr. David D. Woods<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>People throw around \u201cresiliency\u201d quite often when they mean \u201creliability\u201d or \u201chigh availability\u201d. Dr. Woods sets the record straight.<\/p>\n<p><small>Ipsita Agarwal \u2014 Increment<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/increment.com\/reliability\/yelp-traffic-failover-strategy\/\" target=\"_blank\" rel=\"noopener\">[Increment: Reliability] The process: Implementing Yelp\u2019s failover strategy<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>A key part of their strategy is to keep their service running at 50% capacity or less, allowing them to lose a datacenter without overloading the remaining datacenter.<\/p>\n<p><small>Mathieu Frappier, Dorothy Jung, and Qui Nguyen \u2014 Increment<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/increment.com\/reliability\/adaptive-capacity-incident-response\/\" target=\"_blank\" rel=\"noopener\">[Increment: Reliability] On adaptive capacity in incident response<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>In issue <a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-236\/\">#236<\/a>, I linked to an excellent paper by Dr. Richard Cook and Beth Long about engineering resilience in incident response. Now they\u2019re back, teaming up with John Allspaw to summarize and expand on that paper!<\/p>\n<p><small>John Allspaw, Beth Adele Long, and Dr. Richard Cook \u2014 Increment<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.verica.io\/security-chaos-engineering-how-to-security-differently\/\" target=\"_blank\" rel=\"noopener\">Security Chaos Engineering: How to Security Differently<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>A quick <code>s\/security\/reliability\/g<\/code> and this is an SRE article; the same principles apply to both fields.<\/p>\n<p><small>Aaron Rinehart \u2014 Verica<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.blameless.com\/blog\/how-flight-controllers-were-the-first-sres\" target=\"_blank\" rel=\"noopener\">SRE2AUX: How Flight Controllers were the first SREs<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>How can we apply the tenets and principles of NASA mission controllers to our SRE work?<\/p>\n<p><small>Geoff White \u2014 Blameless<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.blameless.com\/blog\/sre-as-organizational-transformation-lessons-from-activist-organizers\" target=\"_blank\" rel=\"noopener\">SRE as Organizational Transformation: Lessons from Activist Organizers<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Genius idea: we can take our lead from activists as we try to win over our organization to adopt SRE principles.<\/p>\n<p><small>Chris Hendrix \u2014 Blameless<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/dropbox.tech\/infrastructure\/atlas--our-journey-from-a-python-monolith-to-a-managed-platform\" target=\"_blank\" rel=\"noopener\">Atlas: Our journey from a Python monolith to a managed platform<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This insightful observation caught my eye:<\/p>\n<blockquote>\n<p>It\u2019s unnecessary overhead for a product team to plan capacity, set up good alerts and multihoming (automatically running in multiple data centers) for small, simple functionality.<\/p>\n<\/blockquote>\n<p><small>Naphat Sanguansin and Utsav Shah \u2014 Dropbox<\/small><\/p>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<ul class=\"sreweekly-outages\">\n<li><a href=\"https:\/\/9to5google.com\/2021\/03\/03\/fitbit-app-down-outage-march21\/\">Fitbit<\/a><\/li>\n<li><a href=\"https:\/\/www.nme.com\/news\/tv\/thousands-of-netflix-users-experience-outages-2893304\">Netflix<\/a><\/li>\n<li><a href=\"https:\/\/www.joblo.com\/movie-news\/wandavision-finale-cause-disney-plus-crash\">Disney+<\/a>\n<ul class=\"sreweekly-outage\">\n<li class=\"sreweekly-outage\">This week was the Wandavision finale.<\/li>\n<\/ul>\n<\/li>\n<li><a href=\"https:\/\/status.fastly.com\/incidents\/nsbq47s47cf5\">Fastly<\/a>\n<ul class=\"sreweekly-outage\">\n<li class=\"sreweekly-outage\"><em><small>Fastly is my employer.<\/small><\/em><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>SRE WEEKLY<\/p>\n","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, StackHawk: Check out this guide to modern dynamic application security testing to learn how it works and what to look for in tooling. http:\/\/sthwk.com\/dynamic-appsec-overview Articles [Increment: Reliability] Interview: Dr. David D. Woods People throw around \u201cresiliency\u201d quite often when they mean \u201creliability\u201d or \u201chigh availability\u201d. Dr. Woods&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-260\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #260<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-280","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":282,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-261\/","url_meta":{"origin":280,"position":0},"title":"SRE Weekly Issue #261","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Join Snyk and StackHawk on March 18 as they walk through how to use Software Composition Analysis (SCA) and Dynamic Application Security Testing (DAST) in CI\/CD to ship more secure applications. http:\/\/sthwk.com\/snyk-stackhawk-webinar Articles What Do Fighter Pilots and Incident Management Have\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":276,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-259\/","url_meta":{"origin":280,"position":1},"title":"SRE Weekly Issue #259","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Mark your calendars! The first conference for OWASP ZAP users is taking place March 9. Get your free ticket to connect with other ZAP users and learn about the project\u2019s roadmap http:\/\/sthwk.com\/zapcon-sreweekly Articles Increment: Reliability This quarter\u2019s Increment issue is about\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":291,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-263\/","url_meta":{"origin":280,"position":2},"title":"SRE Weekly Issue #263","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: You can utilize Swagger Docs in security testing to drive more thorough and accurate vulnerability scans of your APIs. Learn how: http:\/\/sthwk.com\/swagger-api-testing Articles [Increment: Reliability] Tracing a path to observability They make a really clear case for why traditional metrics and\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":653,"url":"https:\/\/fde.cat\/index.php\/2022\/11\/21\/sre-weekly-issue-348\/","url_meta":{"origin":280,"position":3},"title":"SRE Weekly Issue #348","date":"November 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly\u00a0\ud83d\ude92. Rootly automates manual tasks like creating an incident channel, Jira ticket and Zoom rooms, inviting responders, creating statuspage updates, postmortem timelines and more. Want to see why companies like Canva and Grammarly love us?:\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":737,"url":"https:\/\/fde.cat\/index.php\/2023\/07\/23\/sre-weekly-issue-382\/","url_meta":{"origin":280,"position":4},"title":"SRE Weekly Issue #382","date":"July 23, 2023","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Eliminate the anxiety around declaring an incident for nebulous problems by introducing a triage phase into your incident management process. Our latest blog posts dives into why the triage phase is so important, and how you can automate yours with Rootly.\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":297,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-265\/","url_meta":{"origin":280,"position":5},"title":"SRE Weekly Issue #265","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Join StackHawk and WhiteSource tomorrow morning to learn about automated security testing in the DevOps pipeline. With automated dynamic testing and software composition analysis, you can be sure you\u2019re shipping secure APIs and applications. Grab your spot: http:\/\/sthwk.com\/stackhawk-whitesource Articles Insights into\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/280","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=280"}],"version-history":[{"count":1,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/280\/revisions"}],"predecessor-version":[{"id":430,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/280\/revisions\/430"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=280"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=280"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=280"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}