{"id":529,"date":"2022-01-10T02:50:07","date_gmt":"2022-01-10T02:50:07","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2022\/01\/10\/sre-weekly-issue-304\/"},"modified":"2022-01-10T02:50:07","modified_gmt":"2022-01-10T02:50:07","slug":"sre-weekly-issue-304","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2022\/01\/10\/sre-weekly-issue-304\/","title":{"rendered":"SRE Weekly Issue #304"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-304\/\" title=\"Permalink to SRE Weekly Issue #304\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/rootly.com\/demo\/?utm_source=sreweekly\">Rootly<\/a>:<\/h2>\n<p>Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt):<br \/>\n<a href=\"https:\/\/rootly.com\/demo\/?utm_source=sreweekly\">https:\/\/rootly.com\/demo\/?utm_source=sreweekly<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"wp-block-group\">\n<div class=\"wp-block-group__inner-container\">\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/ably.com\/blog\/channel-global-decoupling\" target=\"_blank\" rel=\"noopener\">Channel global decoupling for region discovery<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Ably processes a lot of messages, so when they have to redesign a core part of their architecture, it gets pretty interesting.<\/p>\n<p>\u00a0\u00a0Simon Woolf \u2014 Ably<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/flyingbarron.medium.com\/the-james-webb-space-telescope-making-300-points-of-failure-reliable-db669810a9d8\" target=\"_blank\" rel=\"noopener\">The James Webb Space Telescope\u200a\u2014\u200amaking 300 points of failure reliable<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>If you ask any Site Reliability or DevOps engineer how they feel about a deployment plan with over 300 single points of failure, you\u2019d see a lot of nauseous faces and an outbreak of nervous tics!<\/p>\n<p>Nevertheless, that was the best design.  Read on to find out why.<\/p>\n<p>\u00a0\u00a0Robert Barron<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/slack.engineering\/what-happened-during-slacks-dnssec-rollout\/\" target=\"_blank\" rel=\"noopener\">The Case of the Recursive Resolvers<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Slack had three separate incidents while trying to deploy DNSSEC for slack.com.  This article goes into deep detail on what went wrong each time and what they learned.<\/p>\n<p>Yes, it was an oversight that we did not test a domain with a wildcard record before attempting slack.com \u2014 learn from our mistakes!<\/p>\n<p>\u00a0\u00a0Rafael Elvira and Laura Nolan \u2014 Slack<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.blameless.com\/sre\/building-an-sre-team-with-specialization\" target=\"_blank\" rel=\"noopener\">Building an SRE Team with Specialization<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The specializations outlined in this article include:<\/p>\n<p>The Educator<br \/>\nThe SLO Guard<br \/>\nInfrastructure architect<br \/>\nIncident response leader<\/p>\n<p>\u00a0\u00a0Emily Arnott \u2014 Blameless<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"http:\/\/highscalability.com\/blog\/2022\/1\/3\/designing-whatsapp.html\" target=\"_blank\" rel=\"noopener\">Designing\u00a0WhatsApp<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>If you had to design a WhatsApp today to support its current load, how would you go about it?  Here\u2019s one possible design.<\/p>\n<p>\u00a0\u00a0Ankit Sirmorya \u2014 High Scalability<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/jvns.ca\/blog\/2022\/01\/05\/why-might-you-run-your-own-dns-server-\/\" target=\"_blank\" rel=\"noopener\">Why might you run your own DNS server?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Yesterday I asked on Twitter why you might want to run your own DNS servers, and I got a lot of great answers that I wanted to summarize here.<\/p>\n<p>\u00a0\u00a0Julia Evans<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.pageittothelimit.com\/the-void-with-courtney-nash\/\" target=\"_blank\" rel=\"noopener\">The VOID with Courtney Nash<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>In this podcast interview, find out more about why Courtney Nash created the VOID and how posting an incident report can benefit your company.  Transcript available.<\/p>\n<p>\u00a0\u00a0Mandy Walls (with guest Courtney Nash) \u2014 Page it to the Limit<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.honeycomb.io\/blog\/why-intuitive-troubleshooting-stopped-working\/\" target=\"_blank\" rel=\"noopener\">Why Intuitive Troubleshooting Has Stopped Working for You<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Drawing on Cynefin, this article explains why debugging by feel and guesswork won\u2019t suffice anymore; we need to be methodical.<\/p>\n<p>\u00a0\u00a0Pete Hodgson \u2014 Honeycomb<\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<p><a href=\"https:\/\/cryptonews.com\/news\/solana-repotedly-went-down-again-after-ddos-attack.htm\">Solana<\/a><br \/>\n<a href=\"https:\/\/status.finalsite.com\/incidents\/nfg7n13v6f0d\">Finalsite<\/a><br \/>\n<a href=\"https:\/\/economictimes.indiatimes.com\/tech\/technology\/flipkart-website-app-down-for-some-users-in-india\/articleshow\/88661753.cms\">Flipkart<\/a><br \/>\nSRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles Channel global decoupling for region&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2022\/01\/10\/sre-weekly-issue-304\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #304<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-529","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":543,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/21\/sre-weekly-issue-310\/","url_meta":{"origin":529,"position":0},"title":"SRE Weekly Issue #310","date":"February 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":579,"url":"https:\/\/fde.cat\/index.php\/2022\/05\/30\/sre-weekly-issue-324\/","url_meta":{"origin":529,"position":1},"title":"SRE Weekly Issue #324","date":"May 30, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":535,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/24\/sre-weekly-issue-306\/","url_meta":{"origin":529,"position":2},"title":"SRE Weekly Issue #306","date":"January 24, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":540,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/07\/sre-weekly-issue-308\/","url_meta":{"origin":529,"position":3},"title":"SRE Weekly Issue #308","date":"February 7, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":537,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/31\/sre-weekly-issue-307\/","url_meta":{"origin":529,"position":4},"title":"SRE Weekly Issue #307","date":"January 31, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":546,"url":"https:\/\/fde.cat\/index.php\/2022\/03\/07\/sre-weekly-issue-312\/","url_meta":{"origin":529,"position":5},"title":"SRE Weekly Issue #312","date":"March 7, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/529","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=529"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/529\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=529"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=529"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=529"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}