{"id":300,"date":"2021-08-31T14:40:03","date_gmt":"2021-08-31T14:40:03","guid":{"rendered":"https:\/\/fde.cat\/?p=300"},"modified":"2021-08-31T14:40:03","modified_gmt":"2021-08-31T14:40:03","slug":"sre-weekly-issue-267","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-267\/","title":{"rendered":"SRE Weekly Issue #267"},"content":{"rendered":"<p><a href=\"http:\/\/sreweekly.com\/sre-weekly-issue-267\/\" title=\"Permalink to SRE Weekly Issue #267\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, StackHawk:<\/h2>\n<p>Serverless doesn\u2019t mean secure. Use modern security testing tools to assess serverless applications for vulnerabilities during development.<br \/>\n<a href=\"http:\/\/sthwk.com\/serverless\">http:\/\/sthwk.com\/serverless<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/tech.ebayinc.com\/engineering\/sre-case-study-mysterious-traffic-imbalance\/\">SRE Case Study: Mysterious Traffic Imbalance<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Yet more proof that DNS behavior varies way more than is obvious at first glance. Who the heck thought longest common prefix matching was a good idea?<\/p>\n<p>Charles Li \u2014 eBay<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/stripe.com\/blog\/canonical-log-lines\">Fast and flexible observability with canonical log lines<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The application may log multiple lines during the lifecycle of a request. Stripe has found it invaluable to also log one final line with a fully summary of the request.<\/p>\n<p>Brandur Leach \u2014 Stripe<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/static.googleusercontent.com\/media\/www.google.com\/en\/\/appsstatus\/ir\/4663s0fuvyqu7fg.pdf\">Google Incident Report \u2014 April 12, 2021<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This is a followup with more detail on the G-Suite outage I reported here last week. A database issue caused two separate outages.<\/p>\n<p>Google<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.getcortexapp.com\/post\/the-top-3-mistakes-companies-make-with-slos-slas-and-slis\">The top 3 mistakes companies make with SLOs, SLAs, and SLIs<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Really great advice about 3 common pitfalls in implementing SL*s.<\/p>\n<p>Cortex<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/resilienceroundup.com\/issues\/going-solid-a-model-of-system-dynamics-and-consequences-for-patient-safety\/\">Going solid: a model of system dynamics and consequences for patient safety \u2013 Resilience Roundup<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This research paper explores the <em>marginal boundary<\/em>, a set of conditions beyond which a system enters a different operating mode and an accident is much more likely. It discusses the concept of coupling between seemingly unrelated parts of the system and shows how economic incentives can push a system toward this boundary.<\/p>\n<p>Dr. Richard Cook and Jens Rasmussen (Original paper)<\/p>\n<p>Thai Wood \u2014 Resilience Roundup (summary)<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.catchpoint.com\/2021\/04\/19\/vodafone-idea-bgp-leak-global-routing-system-must-implement-manrs\/?utm_campaign=BGP_Monitoring&amp;utm_medium=email&amp;_hsenc=p2ANqtz-9wk0LlV1LPmkKDB6ZzTZsKSjXQo6GjN0SB9NiueXZukAzdzELZQ02Tg9X2hzAYp8RjCYXjCJE_mvrfKMoqNIowK-O6wQ&amp;_hsmi=122704848&amp;utm_content=122702980&amp;utm_source=hs_email&amp;hsCtaTracking=50b29aee-04e9-446f-8f27-bfd5feedd7f8%7Ca85fcd1c-ea8d-48a6-9f47-ab39f2f1a86a\">Vodafone Idea BGP Leak \u2013 Global Routing System Must Implement MANRS<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This is an analysis of a recent BGP leak with a discussion about how the impact from such events can be mitigated through emerging best practices.<\/p>\n<p>Alessandro Improta and Luca Sani \u2014 Catchpoint<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/developers.soundcloud.com\/blog\/how-to-successfully-hand-over-systems\">How to Successfully Hand Over Systems<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>How do you hand over ownership of a system, transferring enough knowledge that the new owners can maintain its availability and reliability successfully?<\/p>\n<p>Aleksandra Gavrilovska \u2014 SoundCloud<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/shopify.engineering\/resiliency-planning-for-high-traffic-events\">Resiliency Planning for High-Traffic Events<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Shopify works toward Black Friday \/ Cyber Monday all year long, through a combination of load testing, failure mode analysis, game days, and incident analysis.<\/p>\n<p>Ryan McIlmoyl \u2014 Shopify<\/p>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<p><a href=\"https:\/\/piunikaweb.com\/2021\/04\/20\/microsoft-azure-down-throwing-error-503-outage-is-known-and-under-investigation\/\">Microsoft Azure web portal<\/a><br \/>\n<a href=\"https:\/\/www.bleepingcomputer.com\/news\/security\/exchange-online-down-microsoft-365-outage-affects-email-delivery\/\">Microsoft 365<\/a><br \/>\n<a href=\"https:\/\/discord.statuspage.io\/incidents\/gl4j21rcfq3d\">Discord<\/a><br \/>\n<a href=\"https:\/\/www.newser.com\/story\/305277\/somebody-bought-argentinas-google-domain-for-under-3.html\">google.com.ar<\/a><\/p>\n<p>This one\u2019s interesting. A random person was able to buy the domain name google.com.ar, despite the fact that its registration had not expired.<\/p>\n<p>SRE WEEKLY<\/p>\n","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, StackHawk: Serverless doesn\u2019t mean secure. Use modern security testing tools to assess serverless applications for vulnerabilities during development. http:\/\/sthwk.com\/serverless Articles SRE Case Study: Mysterious Traffic Imbalance Yet more proof that DNS behavior varies way more than is obvious at first glance. Who the heck thought longest common&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-267\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #267<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-300","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":817,"url":"https:\/\/fde.cat\/index.php\/2024\/01\/29\/sre-weekly-issue-409\/","url_meta":{"origin":300,"position":0},"title":"SRE Weekly Issue #409","date":"January 29, 2024","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: It\u2019s time for a new world of alerting tools that prioritize engineer well-being and efficiency. The future lies in intelligent systems that are compatible with real life and use conditional rules to adapt and refine thresholds, reducing alert fatigue. https:\/\/firehydrant.com\/blog\/the-alert-fatigue-dilemma-a-call-for-change-in-how-we-manage-on-call\/ Executing\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":721,"url":"https:\/\/fde.cat\/index.php\/2023\/06\/05\/sre-weekly-issue-375\/","url_meta":{"origin":300,"position":1},"title":"SRE Weekly Issue #375","date":"June 5, 2023","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Curious how companies like Figma, Tripadvisor, and 100s of others leverage Rootly to manage incidents in Slack and unlock instant best practices? Check out this lightning demo: https:\/\/www.loom.com\/share\/051c4be0425a436e888dc0c3690855ad Articles How can you land 5 kilometers above the Moon? An in-depth analysis\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":875,"url":"https:\/\/fde.cat\/index.php\/2024\/06\/10\/serverless-jupyter-notebooks-at-meta\/","url_meta":{"origin":300,"position":2},"title":"Serverless Jupyter Notebooks at Meta","date":"June 10, 2024","format":false,"excerpt":"At Meta, Bento, our internal Jupyter notebooks platform, is a popular tool that allows our engineers to mix code, text, and multimedia in a single document. Use cases run the entire spectrum from what we call \u201clite\u201d workloads that involve simple prototyping to heavier and more complex machine learning workflows.\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":543,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/21\/sre-weekly-issue-310\/","url_meta":{"origin":300,"position":3},"title":"SRE Weekly Issue #310","date":"February 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":815,"url":"https:\/\/fde.cat\/index.php\/2024\/01\/22\/sre-weekly-issue-408\/","url_meta":{"origin":300,"position":4},"title":"SRE Weekly Issue #408","date":"January 22, 2024","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: It\u2019s time for a new world of alerting tools that prioritize engineer well-being and efficiency. The future lies in intelligent systems that are compatible with real life and use conditional rules to adapt and refine thresholds, reducing alert fatigue. https:\/\/firehydrant.com\/blog\/the-alert-fatigue-dilemma-a-call-for-change-in-how-we-manage-on-call\/ Tell\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":577,"url":"https:\/\/fde.cat\/index.php\/2022\/05\/16\/sre-weekly-issue-322\/","url_meta":{"origin":300,"position":5},"title":"SRE Weekly Issue #322","date":"May 16, 2022","format":false,"excerpt":"View on sreweekly.com Bit of a short issue this week. This morning, I stepped on my phone, crushing it mightily beneath my bootheel. Unfortunately a lot of my automation for reviewing articles is on there\u2026 thank goodness I have functioning backups. A message from our sponsor, Rootly: Manage incidents directly\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/300","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=300"}],"version-history":[{"count":1,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/300\/revisions"}],"predecessor-version":[{"id":410,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/300\/revisions\/410"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=300"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=300"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=300"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}