{"id":254,"date":"2021-08-31T14:40:46","date_gmt":"2021-08-31T14:40:46","guid":{"rendered":"https:\/\/fde.cat\/?p=254"},"modified":"2021-08-31T14:40:46","modified_gmt":"2021-08-31T14:40:46","slug":"sre-weekly-issue-253","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-253\/","title":{"rendered":"SRE Weekly Issue #253"},"content":{"rendered":"<p><a href=\"http:\/\/sreweekly.com\/sre-weekly-issue-253\/\" title=\"Permalink to SRE Weekly Issue #253\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, StackHawk:<\/h2>\n<p>How do you know if your GraphQL API is secure? Watch StackHawk CSO Scott Gerlach walk through how to run application security tests for GraphQL-backed apps.<br \/>\n<a href=\"http:\/\/sthwk.com\/graphql-webinar\">http:\/\/sthwk.com\/graphql-webinar<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.algolia.com\/blog\/may-30-ssl-incident\/\" target=\"_blank\" rel=\"noopener\">May 30 SSL incident<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>TLS can be such a headache.<\/p>\n<blockquote>\n<p>This was an interesting situation. There was a valid path to the USERTrust RSA Certification Authority, and there was also an expired path. The browser was able to find the valid chain, but the curl was not able to find it.<\/p>\n<\/blockquote>\n<p><small>Adam Surak \u2014 Algolia<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.infoq.com\/articles\/series-enhancing-resilience-2\/\" target=\"_blank\" rel=\"noopener\">Shifting Modes: Creating a Program to Support Sustained Resilience <\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>A well-researched article on shifting emphasis from incident prevention to learning and resilience.<\/p>\n<blockquote>\n<p>Incidents cannot be prevented, because incidents are the inevitable result of success.<\/p>\n<\/blockquote>\n<p><small>Alex Elman<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/surfingcomplexity.blog\/2021\/01\/10\/error-budgets-and-the-legacy-of-herbert-heinrich\/\" target=\"_blank\" rel=\"noopener\">Error budgets and the legacy of Herbert Heinrich<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This one\u2019s worth reading through twice to let it sink in. It puts me in mind of <a href=\"https:\/\/willgallego.com\/2019\/02\/23\/resilience-engineering-and-error-budgets\/\">this article by WIll Gallego<\/a>, which is another thoughtful critique of error budgets.<\/p>\n<blockquote>\n<p>Here are the claims I\u2019m going to make:<\/p>\n<ol>\n<li>Large incidents are much more costly to organizations than small ones, so we should work to reduce the risk of large incidents.<\/li>\n<li>Error budgets don\u2019t help reduce risk of large incidents.<\/li>\n<\/ol>\n<\/blockquote>\n<p><small>Lorin Hochstein<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.unixdaemon.net\/sysadmin\/97-things-every-sre-01\/\" target=\"_blank\" rel=\"noopener\">97 things every SRE should know \u2013 Part 01<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This is a review of a few of the chapters of the book of the same title by Emil Stolarsky and Jaime Woo.<\/p>\n<p>Have you read it too? I\u2019d love to read your take on it!<\/p>\n<p><small>Dean Wilson<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.adaptivecapacitylabs.com\/blog\/2021\/01\/11\/understanding-incidents-three-analytical-traps\/\" target=\"_blank\" rel=\"noopener\">Understanding Incidents: Three Analytical Traps<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This one\u2019s worth reading the next time need to do an incident retrospective. The traps are:<\/p>\n<blockquote>\n<ol>\n<li>Counterfactual reasoning<\/li>\n<li>Normative language<\/li>\n<li>Mechanistic reasoning<\/li>\n<\/ol>\n<\/blockquote>\n<p><small>John Allspaw \u2014 Adaptive Capacity Labs<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.blameless.com\/blog\/the-most-underappreciated-skill-for-sres\" target=\"_blank\" rel=\"noopener\">This Is the Most Underappreciated Skill for SREs<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The skill in question is <em>glue work<\/em>, and I sure appreciate a good gluer when I see one.<\/p>\n<p><small>Emily Arnott \u2014 Blameless<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.pagerduty.com\/blog\/building-scaling-sre-team\/\" target=\"_blank\" rel=\"noopener\">Building and Scaling Your SRE Team<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This one starts out by defining SRE, then goes into how to define your team and fill it with people.<\/p>\n<p><small>Julie Gunderson \u2014 PagerDuty<\/small><\/p>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<ul class=\"sreweekly-outages\">\n<li><a href=\"https:\/\/status.fastly.com\/incidents\/1rbr326kjsmt\">Fastly<\/a>\n<ul>\n<li><em><small>Fastly is my employer.<\/small><\/em><\/li>\n<\/ul>\n<\/li>\n<li><a href=\"https:\/\/status.slack.com\/\/2021-01\/e8d2a136e0850d16\">Slack<\/a><\/li>\n<li><a href=\"https:\/\/themarketherald.com.au\/tyro-asxtyr-updates-market-on-week-long-outage-2021-01-13\/\">Tyro Payments<\/a><\/li>\n<li><a href=\"https:\/\/www.theverge.com\/2021\/1\/15\/22232993\/signal-outage-new-users-messages-not-sending\">Signal<\/a><\/li>\n<li><a href=\"https:\/\/gadgets-africa.com\/2021\/01\/11\/kenic-ke-sites-go-down\/\">.ke TLD (Kenya)<\/a><\/li>\n<li><a href=\"https:\/\/www.express.co.uk\/life-style\/science-technology\/1383089\/Microsoft-Teams-Office-365-OneDrive-SharePoint-outage-status-latest\">Microsoft Teams, Office 365 and OneDrive<\/a><\/li>\n<li><a href=\"https:\/\/www.dailymail.co.uk\/sciencetech\/article-9144877\/Instagram-Worldwide-outage-leaves-thousands-unable-access-desktop-site-app.html\">Instagram<\/a><\/li>\n<\/ul>\n<p>SRE WEEKLY<\/p>\n","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, StackHawk: How do you know if your GraphQL API is secure? Watch StackHawk CSO Scott Gerlach walk through how to run application security tests for GraphQL-backed apps. http:\/\/sthwk.com\/graphql-webinar Articles May 30 SSL incident TLS can be such a headache. This was an interesting situation. There was a&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-253\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #253<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-254","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":318,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-274\/","url_meta":{"origin":254,"position":0},"title":"SRE Weekly Issue #274","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Join the GraphQL Security Testing Learning Lab on June 29 at 9 AM PT. Learn how to run automated security testing against your GraphQL APIs so you can find and fix vulnerabilities fast. http:\/\/sthwk.com\/graphql-learning-lab Articles Chicken Soup for the SLO The\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":324,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-276\/","url_meta":{"origin":254,"position":1},"title":"SRE Weekly Issue #276","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Get ready for some GraphQL! Tune in this Tuesday, June 29 at 9 AM PT for an automated GraphQL security testing learning lab. Register: http:\/\/sthwk.com\/graphql-learning-lab Articles @GergelyOrosz on blaming the intern HBO accidentally sent an email to a bunch of people,\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":269,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-257\/","url_meta":{"origin":254,"position":2},"title":"SRE Weekly Issue #257","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Keeping your APIs secure requires thoughtful design and testing. Learn how to protect your REST, SOAP and GraphQL APIs from security vulnerabilities with StackHawk http:\/\/sthwk.com\/api-protection Articles Sometimes alerts have inobvious reasons for existing This one really got me thinking. Make sure\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":746,"url":"https:\/\/fde.cat\/index.php\/2023\/08\/14\/sre-weekly-issue-385\/","url_meta":{"origin":254,"position":3},"title":"SRE Weekly Issue #385","date":"August 14, 2023","format":false,"excerpt":"View on sreweekly.com Many apologies to Matt Cooper at GitHub, who is the actual author of the article Scaling Merge-ort Across GitHub from last week. Sorry for the mis-credit, Matt! A message from our sponsor, Rootly: When incidents impact your customers, failing to communicate with them effectively can erode trust\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":499,"url":"https:\/\/fde.cat\/index.php\/2021\/11\/01\/the-journey-of-building-a-scalable-api\/","url_meta":{"origin":254,"position":4},"title":"The Journey of Building a Scalable API","date":"November 1, 2021","format":false,"excerpt":"APIs are an essential tool to allow partners, developers, and applications to consume, communicate\u00a0,or build on top of the various capabilities your microservices provide. Building a high quality API that can scale and perform with the business ecosystem is not easy and requires putting thought and planning into everything, from\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":255,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-252\/","url_meta":{"origin":254,"position":5},"title":"SRE Weekly Issue #252","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Interested in how you can automate application security testing with GitHub Actions? Check out this on demand webinar from StackHawk and Snyk and see how simple it is to get started. https:\/\/sthwk.com\/stackhawk-snyk Articles Building On-Call Culture at GitHub Their on-call started\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/254","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=254"}],"version-history":[{"count":1,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/254\/revisions"}],"predecessor-version":[{"id":449,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/254\/revisions\/449"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=254"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=254"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=254"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}