{"id":286,"date":"2021-08-31T14:40:23","date_gmt":"2021-08-31T14:40:23","guid":{"rendered":"https:\/\/fde.cat\/?p=286"},"modified":"2021-08-31T14:40:23","modified_gmt":"2021-08-31T14:40:23","slug":"sre-weekly-issue-262","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-262\/","title":{"rendered":"SRE Weekly Issue #262"},"content":{"rendered":"<p><a href=\"http:\/\/sreweekly.com\/sre-weekly-issue-262\/\" title=\"Permalink to SRE Weekly Issue #262\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, StackHawk:<\/h2>\n<p>Join the Secure Coding Summit to hear from industry-leading AppSec and DevSecOps practitioners, analysts, and visionaries as they share their best pro tips to level up your code security.<br \/>\n<a href=\"http:\/\/sthwk.com\/secure-code-summit\">http:\/\/sthwk.com\/secure-code-summit<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.verica.io\/blog\/four-prerequisites-for-chaos-engineering\/\" target=\"_blank\" rel=\"noopener\">The Prerequisites for Chaos Engineering<\/a><\/div>\n<div class=\"sreweekly-description\">\n<blockquote>\n<p>Chaos Engineering isn\u2019t adding chaos to your systems\u2014it\u2019s seeing the chaos that already exists in your systems.<\/p>\n<\/blockquote>\n<p>Along with four prerequisites, this article also includes 3 myths about chaos engineering that might be making you feel hesitant about starting.<\/p>\n<p><small>Courtney Nash \u2014 Verica<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.transposit.com\/blog\/2020.05.07-managing-on-call-in-a-pandemic\/\" target=\"_blank\" rel=\"noopener\">Managing On-Call in a Pandemic<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This one\u2019s from May of last year. Almost a year on, it\u2019s interesting to see which of these we\u2019ve already implemented.<\/p>\n<p><small>Ashley Roof \u2014 Transposit<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/engineering.indeedblog.com\/blog\/2019\/10\/being-just-reliable-enough\/\" target=\"_blank\" rel=\"noopener\">Being Just Reliable Enough<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>An amusing parable illustrating why not to try to be <em>too<\/em> reliable.<\/p>\n<p><small>Andrew Ford \u2014 Indeed<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.verdict.co.uk\/liar-liar-kremlins-on-fire-google-flatly-contradicts-russian-outage-story\/\" target=\"_blank\" rel=\"noopener\">Google debunks Russian claims that fire was connected to service outage<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>In the Outages section of last week\u2019s issue, you\u2019ll find two unrelated events referenced in this article: one about Russian internet censorship gone awry and another about a major datacenter fire.<\/p>\n<p><small>Eric Johansson \u2014 Verdict<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.blameless.com\/blog\/how-to-analyze-contributing-factors-blamelessly\" target=\"_blank\" rel=\"noopener\">How to Analyze Contributing Factors Blamelessly<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Along with what\u2019s in the title, this article also covers the difference between an RCA and a contributing factors analysis.<\/p>\n<p><small>Emily Arnott \u2014 Blameless<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/engineering.linkedin.com\/blog\/2021\/rethinking-site-capacity-projections-with-capacity-analyzer\" target=\"_blank\" rel=\"noopener\">Rethinking site capacity projections with Capacity Analyzer<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Lots of detail on how LinkedIn is improving their traffic forecasts. Warning\/enticement: math contained within.<\/p>\n<p><small>Deepanshu Mehndiratta \u2014 LinkedIn<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/launchdarkly.com\/blog\/testing-in-production-for-safety-and-sanity\/\" target=\"_blank\" rel=\"noopener\">Testing in Production for Safety and Sanity<\/a><\/div>\n<div class=\"sreweekly-description\">\n<blockquote>\n<p>Everyone is testing in production, some organizations admit and plan for it.<\/p>\n<\/blockquote>\n<p>How to do it right, what can happen if it goes wrong, and how to limit the blast radius.<\/p>\n<p><small>Heidi Waterhouse \u2014 LaunchDarkly<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/github.blog\/2021-03-18-how-we-found-and-fixed-a-rare-race-condition-in-our-session-handling\/\" target=\"_blank\" rel=\"noopener\">How we found and fixed a rare race condition in our session handling<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Remember when GitHub logged you out? Ah, I remember it like it was last week. I mean, the week before. Here\u2019s GitHub\u2019s troubleshooting story about what went wrong.<\/p>\n<p><small>Dirkjan Bussink \u2014 GitHub<\/small><\/p>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<ul class=\"sreweekly-outages\">\n<li><a href=\"https:\/\/status.cloud.google.com\/incident\/cloud-networking\/21006#21006001\">Google Cloud Platform<\/a>\n<ul class=\"sreweekly-outage\">\n<li class=\"sreweekly-outage\">GCP had a major multi-region networking issue, due to a routing glitch. Click through for their followup post.<\/li>\n<\/ul>\n<\/li>\n<li><a href=\"https:\/\/www.latimes.com\/environment\/story\/2021-03-16\/noaa-tsunami-sensors-went-down-ahead-of-10th-anniversary-of-japans-tohoku-disaster\">US National Oceanic and Atmospheric Administration (NOAA)<\/a>\n<ul class=\"sreweekly-outage\">\n<li class=\"sreweekly-outage\">This outage impaired NOAA\u2019s tsunami early warning system.<\/li>\n<\/ul>\n<\/li>\n<li><a href=\"https:\/\/thestarphoenix.com\/pmn\/business-pmn\/facebook-services-suffer-global-outage-with-instagram-down-for-nearly-a-million\">Facebook, Instagram, and WhatsApp<\/a><\/li>\n<li><a href=\"https:\/\/www.express.co.uk\/life-style\/science-technology\/1411375\/TikTok-DOWN-is-TikTok-not-working-right-now-is-TikTok-FYP-broken\">TikTok<\/a><\/li>\n<li><a href=\"https:\/\/reddit.statuspage.io\/incidents\/l9yh2xfs892m\">Elevated error rates<\/a><\/li>\n<li><a href=\"https:\/\/status.azure.com\/en-us\/status\/history\/\">Microsoft Teams and other services<\/a>\n<ul class=\"sreweekly-outage\">\n<li class=\"sreweekly-outage\">Click through for a highly detailed description of what went wrong. I can\u2019t link directly to the incident in question, so you\u2019ll have to scroll down to 3\/15.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>SRE WEEKLY<\/p>\n","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, StackHawk: Join the Secure Coding Summit to hear from industry-leading AppSec and DevSecOps practitioners, analysts, and visionaries as they share their best pro tips to level up your code security. http:\/\/sthwk.com\/secure-code-summit Articles The Prerequisites for Chaos Engineering Chaos Engineering isn\u2019t adding chaos to your systems\u2014it\u2019s seeing the&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-262\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #262<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-286","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":525,"url":"https:\/\/fde.cat\/index.php\/2021\/12\/28\/sre-netflix-at-srecon\/","url_meta":{"origin":286,"position":0},"title":"SRE Netflix at SRECon","date":"December 28, 2021","format":"video","excerpt":"190 Countries and 5 CORE SREs by Jonah Horowitz How does Netflix scale SRE? How do we manage over 70 million customers around the world without a 24\/7 operations center? With tens of thousands of Linux instances in a distributed system architecture, and thousands of daily production changes, it's an\u2026","rel":"","context":"In &quot;External&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":303,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-268\/","url_meta":{"origin":286,"position":1},"title":"SRE Weekly Issue #268","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Join StackHawk Tuesday May 4 at 9 am PT for a hands-on technical workshop! By the end of the session, you will have three types of security testing running in your GitHub pipeline. Register: http:\/\/sthwk.com\/technical-workshop Articles Manageable On-Call for Companies without\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":687,"url":"https:\/\/fde.cat\/index.php\/2023\/03\/06\/sre-weekly-issue-362\/","url_meta":{"origin":286,"position":2},"title":"SRE Weekly Issue #362","date":"March 6, 2023","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly\u00a0\ud83d\ude92. Rootly automates manual tasks like creating an incident channel, Jira ticket and Zoom rooms, inviting responders, creating statuspage updates, postmortem timelines and more. Want to see why companies like Canva and Grammarly love us?:\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":304,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-269\/","url_meta":{"origin":286,"position":3},"title":"SRE Weekly Issue #269","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Tune into ZAPCon After Hours this Tuesday at 8 am PT to learn how to include automated security testing in your builds with ZAP http:\/\/sthwk.com\/after-hours-3 Articles Edgar: Solving Mysteries Faster with Observability We built Edgar to ease this burden, by empowering\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":324,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-276\/","url_meta":{"origin":286,"position":4},"title":"SRE Weekly Issue #276","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Get ready for some GraphQL! Tune in this Tuesday, June 29 at 9 AM PT for an automated GraphQL security testing learning lab. Register: http:\/\/sthwk.com\/graphql-learning-lab Articles @GergelyOrosz on blaming the intern HBO accidentally sent an email to a bunch of people,\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":255,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-252\/","url_meta":{"origin":286,"position":5},"title":"SRE Weekly Issue #252","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Interested in how you can automate application security testing with GitHub Actions? Check out this on demand webinar from StackHawk and Snyk and see how simple it is to get started. https:\/\/sthwk.com\/stackhawk-snyk Articles Building On-Call Culture at GitHub Their on-call started\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/286","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=286"}],"version-history":[{"count":1,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/286\/revisions"}],"predecessor-version":[{"id":424,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/286\/revisions\/424"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=286"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=286"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=286"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}