{"id":324,"date":"2021-08-31T14:39:51","date_gmt":"2021-08-31T14:39:51","guid":{"rendered":"https:\/\/fde.cat\/?p=324"},"modified":"2021-08-31T14:39:51","modified_gmt":"2021-08-31T14:39:51","slug":"sre-weekly-issue-276","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-276\/","title":{"rendered":"SRE Weekly Issue #276"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-276\/\" title=\"Permalink to SRE Weekly Issue #276\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, StackHawk:<\/h2>\n<p>Get ready for some GraphQL! Tune in this Tuesday, June 29 at 9 AM PT for an automated GraphQL security testing learning lab. Register:<br \/>\n<a href=\"http:\/\/sthwk.com\/graphql-learning-lab\">http:\/\/sthwk.com\/graphql-learning-lab<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/mobile.twitter.com\/gergelyorosz\/status\/1405865948716929029?s=21\">@GergelyOrosz on blaming the intern<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>HBO accidentally sent an email to a bunch of people, and they tweeted (jokingly?) blaming their intern. This is a link to a short, thoughtful response thread.<\/p>\n<p>Gergely Orosz<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/bunny.net\/blog\/the-stack-overflow-of-death-dns-collapse\/\">The stack overflow of death. How we lost DNS and what we\u2019re doing to prevent this in the future.<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This is the story of the Bunny CDN outage linked below. Great read, thanks folks!<\/p>\n<p>Dejan Grofelnik Pelzel \u2014 Bunny<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/ably.com\/blog\/8-fallacies-of-distributed-computing\">Navigating the 8 fallacies of distributed computing<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>There\u2019s never a bad time to review the fallacies of distributed computing. This article introduces them with examples and discussion of each.<\/p>\n<p>Alex Diaconu \u2014 Ably<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/rootly.io\/blog\/7-essential-tools-for-sres\">7 Essential Tools for SREs<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>These aren\u2019t specific tools, but rather 7 classes of tools (with examples). They are:<\/p>\n<p>Chaos engineering<br \/>\nMonitoring and alerting<br \/>\nObservability<br \/>\nPaging tools<br \/>\nSLO management<br \/>\nInfrastructure-as-Code (and everything-as-code)<br \/>\nAutomated incident response<\/p>\n<p>Quentin Rousseau \u2014 Rootly<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/surfingcomplexity.blog\/2021\/06\/20\/designing-like-a-joint-cognitive-system\/\">Designing like a joint cognitive system<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Design is interpretive. We have to find common ground before we can even start to create a design, but finding that common ground is <em>part<\/em> of the design.<\/p>\n<p>For example, we think of building codes as being precise, but when applied to new situations, they are ambiguous, and the engineers must make a judgment about how to apply them.<\/p>\n<p>Lorin Hochstein<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.blameless.com\/blog\/resilience-in-action-e8-vanessa-yiu-on-crafting-enterprise-architecture\">Resilience in Action E8: Vanessa Yiu on Crafting Enterprise Architecture<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This starts with a really neat moment in which the interviewer asks Yiu to talk about lessons from her jewelry-making hobby that she applies to SRE.<\/p>\n<p>Kurt Andersen<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.reddit.com\/r\/RedditEng\/comments\/o4y4dq\/rwallstreetbets_incident_anthology_reddits_open\/\">r\/WallStreetBets Incident Anthology: Reddit\u2019s Open Systems<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>When Gamestop\u2019s stock shot through the roof earlier this year, Reddit\u2019s traffic did too. This is the first article in a short series by Reddit\u2019s SRE team on how they handled the influx.<\/p>\n<p>This article is about the ways that user actions affected their systems in unexpected ways, and how they responded.<\/p>\n<p>Courtney Wang \u2014 Reddit<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/dubrie.medium.com\/sre-cultural-values-a0073b475183\">SRE Cultural Values<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Recently in our Site Reliability Engineering organization in Azure, we established a set of cultural values that we hold ourselves and each other accountable to.<\/p>\n<p>Bill Johnson \u2014 Microsoft<\/p>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<p><a href=\"https:\/\/arstechnica.com\/gadgets\/2021\/06\/mass-data-wipe-in-my-book-devices-prompts-warning-from-western-digital\/\">Western Digital \u201cMy Book Live\u201d hard drives<\/a><br \/>\n<a href=\"https:\/\/www.pymnts.com\/amazon\/2021\/amazon-website-back-online-following-brief-outage\/\">Amazon Prime Video and Alexa<\/a><br \/>\n<a href=\"https:\/\/www.chemistanddruggist.co.uk\/news\/pharmacies-forced-revert-paper-system-pharmoutcomes-outage\">PharmOutcomes<\/a><\/p>\n<p>PharmOutcomes is a SaaS used by pharmacies.<\/p>\n<p><a href=\"https:\/\/www.9news.com.au\/national\/commbank-app-and-merchant-terminals-go-down-frustrating-customers\/56ff8d82-168f-43b8-ac99-784e459bb8b5\">Commonwealth Bank<\/a><br \/>\n<a href=\"https:\/\/medium.statuspage.io\/incidents\/f3rg84bpv2zn\">medium<\/a><\/p>\n<p>I\u2019ve gotten a few 500s from Medium while trying to review articles last week and this week. Maybe it\u2019s this incident on their status page?<\/p>\n<p><a href=\"https:\/\/status.bunnycdn.com\/incidents\/c7rdtvhz1b7n\">Bunny (CDN)<\/a><br \/>\n<a href=\"https:\/\/reddit.statuspage.io\/incidents\/3710db57k6n8\">reddit<\/a><\/p>\n<p>This post on their status site says \u201cAPI errors\u201d, but I saw rumblings that suggested that reddit itself was down.<\/p>\n<p>SRE WEEKLY<\/p>\n","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, StackHawk: Get ready for some GraphQL! Tune in this Tuesday, June 29 at 9 AM PT for an automated GraphQL security testing learning lab. Register: http:\/\/sthwk.com\/graphql-learning-lab Articles @GergelyOrosz on blaming the intern HBO accidentally sent an email to a bunch of people, and they tweeted (jokingly?) blaming&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-276\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #276<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-324","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":318,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-274\/","url_meta":{"origin":324,"position":0},"title":"SRE Weekly Issue #274","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Join the GraphQL Security Testing Learning Lab on June 29 at 9 AM PT. Learn how to run automated security testing against your GraphQL APIs so you can find and fix vulnerabilities fast. http:\/\/sthwk.com\/graphql-learning-lab Articles Chicken Soup for the SLO The\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":254,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-253\/","url_meta":{"origin":324,"position":1},"title":"SRE Weekly Issue #253","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: How do you know if your GraphQL API is secure? Watch StackHawk CSO Scott Gerlach walk through how to run application security tests for GraphQL-backed apps. http:\/\/sthwk.com\/graphql-webinar Articles May 30 SSL incident TLS can be such a headache. This was an\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":746,"url":"https:\/\/fde.cat\/index.php\/2023\/08\/14\/sre-weekly-issue-385\/","url_meta":{"origin":324,"position":2},"title":"SRE Weekly Issue #385","date":"August 14, 2023","format":false,"excerpt":"View on sreweekly.com Many apologies to Matt Cooper at GitHub, who is the actual author of the article Scaling Merge-ort Across GitHub from last week. Sorry for the mis-credit, Matt! A message from our sponsor, Rootly: When incidents impact your customers, failing to communicate with them effectively can erode trust\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":499,"url":"https:\/\/fde.cat\/index.php\/2021\/11\/01\/the-journey-of-building-a-scalable-api\/","url_meta":{"origin":324,"position":3},"title":"The Journey of Building a Scalable API","date":"November 1, 2021","format":false,"excerpt":"APIs are an essential tool to allow partners, developers, and applications to consume, communicate\u00a0,or build on top of the various capabilities your microservices provide. Building a high quality API that can scale and perform with the business ecosystem is not easy and requires putting thought and planning into everything, from\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":269,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-257\/","url_meta":{"origin":324,"position":4},"title":"SRE Weekly Issue #257","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Keeping your APIs secure requires thoughtful design and testing. Learn how to protect your REST, SOAP and GraphQL APIs from security vulnerabilities with StackHawk http:\/\/sthwk.com\/api-protection Articles Sometimes alerts have inobvious reasons for existing This one really got me thinking. Make sure\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":622,"url":"https:\/\/fde.cat\/index.php\/2022\/08\/22\/sre-weekly-issue-335\/","url_meta":{"origin":324,"position":5},"title":"SRE Weekly Issue #335","date":"August 22, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/324","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=324"}],"version-history":[{"count":1,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/324\/revisions"}],"predecessor-version":[{"id":386,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/324\/revisions\/386"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=324"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=324"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=324"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}