{"id":498,"date":"2021-11-01T01:06:07","date_gmt":"2021-11-01T01:06:07","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2021\/11\/01\/sre-weekly-issue-294\/"},"modified":"2021-11-01T01:06:07","modified_gmt":"2021-11-01T01:06:07","slug":"sre-weekly-issue-294","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2021\/11\/01\/sre-weekly-issue-294\/","title":{"rendered":"SRE Weekly Issue #294"},"content":{"rendered":"<p><a href=\"https:\/\/sreweekly.com\/sre-weekly-issue-294\/\" title=\"Permalink to SRE Weekly Issue #294\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, <a href=\"https:\/\/rootly.com\/?utm_source=sreweekly\">Rootly<\/a>:<\/h2>\n<p>Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo:<br \/>\n<a href=\"https:\/\/rootly.io\/?utm_source=sreweekly\">https:\/\/rootly.com\/?utm_source=sreweekly<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.forbes.com\/sites\/forbestechcouncil\/2021\/09\/20\/five-steps-to-reduce-sre-toil-and-add-more-value\/\">Five Steps To Reduce SRE Toil And Add More Value<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The steps are:<\/p>\n<p>Know How Much Time Is Spent On Toil<br \/>\nFind The Toil<br \/>\nDetermine The Root Causes Of Toil<br \/>\nFind And Prioritize The Low-Hanging Fruit<br \/>\nPromote Toil Reduction<\/p>\n<p>Aater Suleman \u2014 Forbes<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/grafana.com\/blog\/2021\/10\/13\/how-were-building-a-production-readiness-review-process-at-grafana-labs\/\">How we\u2019re building a production readiness review process at Grafana Labs<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>I like how they try to strike a balance and avoid reviewing too far in depth, while still hitting everything important.<\/p>\n<p>Milan Pl\u017e\u00edk \u2014 Grafana Labs<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.opslevel.com\/blog\/opslevel-convos-seth-lochen-groupon\/\">Seth Lochen of Groupon talks ownership and the bystander effect, platform engineering, and frogs in boiling water<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Lots of good stuff in this one about one of my favorite topics, service ownership.<\/p>\n<p>Kenneth Rose \u2014 OpsLevel<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/ably.com\/blog\/crdts-distributed-data-consistency-challenges\">How do CRDTs solve distributed data consistency challenges?<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This is the intro I needed to understand Conflict-Free Replicated Data Types.<\/p>\n<p>Jo Stichbury \u2014 Ably<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/devops.com\/defining-availability-maintainability-and-reliability-in-sre\/\">Defining Availability, Maintainability and Reliability in SRE<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Availability, maintainability and reliability all have distinct\u2014if related\u2014meanings, and they each play different roles in reliability operations.<\/p>\n<p>JJ Tang \u2014 DevOps.com<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/cloudpundit.com\/2021\/10\/28\/five-p-factors-for-root-cause-analysis\/\">Five-P factors for root cause analysis<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The five Ps come from medicine and understanding medical accidents, but they apply equally well to analyzing incidents in IT.<\/p>\n<p>Lydia Leong<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/newsletter.pragmaticengineer.com\/p\/incident-review-best-practices\">Incident Review and Postmortem Best Practices<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>I really love the focus on de-emphasizing finding action items in incident retrospectives, in favor of learning.<\/p>\n<p>Gergely Orosz \u2014 The Pragmatic Engineer<\/p>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<p><a href=\"https:\/\/status.twilio.com\/incidents\/wdrlk4qps0z1\">AT&amp;T SMS in the US<\/a><\/p>\n<p>This week, I saw several status pages point to some kind of problem in their ability to send SMS notifications to AT&amp;T phones. I thought this was interesting because usually I don\u2019t learn about an outage solely from <em>other<\/em> companies\u2019 status pages.<\/p>\n<p><a href=\"https:\/\/www.google.com\/appsstatus\/dashboard\/incidents\/k71P8nHp32hgcMSsC3mR\">Google Meet<\/a><br \/>\n<a href=\"https:\/\/www.forbes.com\/sites\/barrycollins\/2021\/10\/24\/tesco-website-suffers-day-long-outage\/\">Tesco<\/a><br \/>\n<a href=\"https:\/\/www.theepochtimes.com\/coinbase-faces-extended-outages-users-unable-to-trade-shiba-inu_4075487.html\">Coinbase<\/a><br \/>\n<a href=\"https:\/\/www.india.com\/news\/india\/breaking-zomato-server-down-due-to-outage-customers-face-issue-while-ordering-food-5075873\/\">Zomato<\/a><br \/>\n<a href=\"https:\/\/www.inentertainment.co.uk\/barclays-online-banking-system-and-app-go-down\/\">Barclays<\/a><br \/>\n<a href=\"https:\/\/www.cityam.com\/breaking-hsbc-online-banking-down-leaving-customers-locked-out-of-accounts\/\">HSBC<\/a><br \/>\nSRE WEEKLY<\/p>","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo: https:\/\/rootly.com\/?utm_source=sreweekly Articles Five Steps To Reduce SRE Toil And Add More Value The&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2021\/11\/01\/sre-weekly-issue-294\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #294<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-498","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":343,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-282\/","url_meta":{"origin":498,"position":0},"title":"SRE Weekly Issue #282","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: ICYMI ZAP Creator and Project Lead Simon Bennetts recently unveiled ZAP\u2019s new automation framework. Watch the session and see how it works: https:\/\/sthwk.com\/Automation-Framework Articles A thorough introduction to bpftrace I really need to learn bpftrace, and this article is a great\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":855,"url":"https:\/\/fde.cat\/index.php\/2024\/04\/15\/sre-weekly-issue-420\/","url_meta":{"origin":498,"position":1},"title":"SRE Weekly Issue #420","date":"April 15, 2024","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, FireHydrant: FireHydrant is now AI-powered for faster, smarter incidents! Power up your incidents with auto-generated real-time summaries, retrospectives, and status page updates. https:\/\/firehydrant.com\/blog\/ai-for-incident-management-is-here\/ 1.0 Launch Retrospective The game Last Epoch launched in February, and they had a rocky start. This huge retrospective\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":509,"url":"https:\/\/fde.cat\/index.php\/2021\/11\/29\/sre-weekly-issue-298\/","url_meta":{"origin":498,"position":2},"title":"SRE Weekly Issue #298","date":"November 29, 2021","format":false,"excerpt":"View on sreweekly.com Email subscribers, my apologies for the double-send last week. I upgraded WordPress and subsequently further cemented my distrust of all version upgrades ever. I carefully tested a fix in staging before rolling it out gradually in preparation for this week\u2019s issue. Just kidding, I hacked on it\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":543,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/21\/sre-weekly-issue-310\/","url_meta":{"origin":498,"position":3},"title":"SRE Weekly Issue #310","date":"February 21, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":579,"url":"https:\/\/fde.cat\/index.php\/2022\/05\/30\/sre-weekly-issue-324\/","url_meta":{"origin":498,"position":4},"title":"SRE Weekly Issue #324","date":"May 30, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":540,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/07\/sre-weekly-issue-308\/","url_meta":{"origin":498,"position":5},"title":"SRE Weekly Issue #308","date":"February 7, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/498","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=498"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/498\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=498"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=498"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=498"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}