{"id":261,"date":"2021-08-31T14:40:46","date_gmt":"2021-08-31T14:40:46","guid":{"rendered":"https:\/\/fde.cat\/?p=261"},"modified":"2021-08-31T14:40:46","modified_gmt":"2021-08-31T14:40:46","slug":"sre-weekly-issue-256","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-256\/","title":{"rendered":"SRE Weekly Issue #256"},"content":{"rendered":"<p><a href=\"http:\/\/sreweekly.com\/sre-weekly-issue-256\/\" title=\"Permalink to SRE Weekly Issue #256\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, StackHawk:<\/h2>\n<p>Register now for the first-ever ZAPCon taking place March 9th. The free event will focus on OWASP ZAP and application security best practices. You wont want to miss it!<br \/>\n<a href=\"http:\/\/sthwk.com\/zapcon-sre-weekly\">http:\/\/sthwk.com\/zapcon-sre-weekly<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/slack.engineering\/slacks-outage-on-january-4th-2021\/\" target=\"_blank\" rel=\"noopener\">Slack\u2019s Outage on January 4th 2021<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Here\u2019s a blog post from Slack giving even more information about what went wrong on January 4. Bravo, Slack, there\u2019s a lot in here for us to learn from.<\/p>\n<p><small>Laura Nolan \u2014 Slack<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/research.fb.com\/publications\/zero-downtime-release-disruption-free-load-balancing-of-a-multi-billion-user-website\/\" target=\"_blank\" rel=\"noopener\">Zero Downtime Release: Disruption-free Load Balancing of a Multi-Billion User Website<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This academic paper from Facebook explains how they release code without disrupting active connections, even for a small number of users.<\/p>\n<p><small>Usama Naseer, Luca Niccolini, Udip Pant, Alan Frindell, Ranjeeth Dasineni, and Theophilus A. Benson \u2014 Facebook<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/billduncan.org\/notam-for-sres\/\" target=\"_blank\" rel=\"noopener\">NOTAM for SREs<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Another lesson we can learn from aviation: have one place where engineers can find out about temporary infrastructure changes that are important.<\/p>\n<p><small>Bill Duncan<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/blog.coinbase.com\/incident-post-mortem-january-29-2021-5ab5247e43da\" target=\"_blank\" rel=\"noopener\">Incident Post Mortem: January 29, 2021 [Coinbase]<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Coinbase posted this detailed analysis of their January 29th incident.<\/p>\n<p><small>Coinbase<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.forbes.com\/sites\/forbestechcouncil\/2021\/02\/04\/how-cloud-services-platform-teams-can-drive-the-adoption-of-effective-sre-practices\/\" target=\"_blank\" rel=\"noopener\">Council Post: How Cloud Services Platform Teams Can Drive The Adoption Of Effective SRE Practices<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Interesting thesis: a company moving into the cloud is in a unique position to adopt SRE practices \u2014 and better situated than cloud-first companies.<\/p>\n<p><small>Tina Huang (CTO, Transposit) \u2014 Forbes<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.blameless.com\/blog\/im-just-doing-my-job-an-sre-myth\" target=\"_blank\" rel=\"noopener\">\u201cI\u2019m Just Doing my Job,\u201d An SRE Myth<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>We need to push past surface-level mitigation of an incident and really dig in and learn.<\/p>\n<p><small>Darrell Pappa \u2014 Blameless<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/github.blog\/2021-02-02-github-availability-report-january-2021\/\" target=\"_blank\" rel=\"noopener\">GitHub Availability Report: January 2021<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>GitHub\u2019s database failed in a manner that wasn\u2019t detected by their automated failover system.<\/p>\n<p><small>Keith Ballinger \u2014 GitHub<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/engineering.linkedin.com\/blog\/2021\/open-source-update--school-of-sre\" target=\"_blank\" rel=\"noopener\">Open source update: School of SRE<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>LinkedIn published their SRE training documentation in the form of a full curriculum covering a range of topics.<\/p>\n<p><small>Akbar KM and Kalyanasundaram Somasundaram \u2014 LinkedIn<\/small><\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/rachelbythebay.com\/w\/2021\/02\/03\/bits\/\" target=\"_blank\" rel=\"noopener\">Push some big numbers through your system and look for bugs<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Your code may be designed to handle 64-bit integers, but what if a library (such as a JSON decoder) converts them to floating point numbers?<\/p>\n<p><small>rachelbythebay<\/small><\/p>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<ul class=\"sreweekly-outages\">\n<li><a href=\"https:\/\/finance.yahoo.com\/news\/apple-icloud-outage-201939429.html\">Apple iCloud<\/a><\/li>\n<li><a href=\"https:\/\/www.bleepingcomputer.com\/news\/security\/spamcop-anti-spam-service-suffers-an-outage-after-its-domain-expired\/\">SpamCop<\/a><\/li>\n<li><a href=\"https:\/\/www.androidpolice.com\/2021\/02\/01\/wink-has-been-down-for-a-week-and-counting\/\">Wink<\/a><\/li>\n<li><a href=\"https:\/\/www.polygon.com\/streaming\/2021\/2\/2\/22219391\/twitch-down-feb-2-2021-investigating-problems\">Twitch<\/a><\/li>\n<li><a href=\"https:\/\/www.techtimes.com\/articles\/256753\/20210205\/disney-down-error-code-83-401-server-outage-persists-viral.htm\">Disney+<\/a><\/li>\n<\/ul>\n<p>SRE WEEKLY<\/p>\n","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, StackHawk: Register now for the first-ever ZAPCon taking place March 9th. The free event will focus on OWASP ZAP and application security best practices. You wont want to miss it! http:\/\/sthwk.com\/zapcon-sre-weekly Articles Slack\u2019s Outage on January 4th 2021 Here\u2019s a blog post from Slack giving even more&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-256\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #256<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-261","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":540,"url":"https:\/\/fde.cat\/index.php\/2022\/02\/07\/sre-weekly-issue-308\/","url_meta":{"origin":261,"position":0},"title":"SRE Weekly Issue #308","date":"February 7, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https:\/\/rootly.com\/demo\/?utm_source=sreweekly Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":681,"url":"https:\/\/fde.cat\/index.php\/2023\/02\/20\/sre-weekly-issue-360\/","url_meta":{"origin":261,"position":1},"title":"SRE Weekly Issue #360","date":"February 20, 2023","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly\u00a0\ud83d\ude92. Rootly automates manual tasks like creating an incident channel, Jira ticket and Zoom rooms, inviting responders, creating statuspage updates, postmortem timelines and more. Want to see why companies like Canva and Grammarly love us?:\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":253,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-254\/","url_meta":{"origin":261,"position":2},"title":"SRE Weekly Issue #254","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Need to run a standalone Kotlin app as a fat jar in a Gradle project? Check out how we handled that! http:\/\/sthwk.com\/kotlin-with-gradle Articles Coinbase Incident Post Mortem: January 6\u20137, 2021 This one\u2019s juicy. At one point, the front-end was blocked up,\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":487,"url":"https:\/\/fde.cat\/index.php\/2021\/10\/11\/sre-weekly-issue-291\/","url_meta":{"origin":261,"position":3},"title":"SRE Weekly Issue #291","date":"October 11, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo: https:\/\/rootly.io\/?utm_source=sreweekly Articles Understanding How Facebook Disappeared from the\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":269,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-257\/","url_meta":{"origin":261,"position":4},"title":"SRE Weekly Issue #257","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Keeping your APIs secure requires thoughtful design and testing. Learn how to protect your REST, SOAP and GraphQL APIs from security vulnerabilities with StackHawk http:\/\/sthwk.com\/api-protection Articles Sometimes alerts have inobvious reasons for existing This one really got me thinking. Make sure\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":463,"url":"https:\/\/fde.cat\/index.php\/2021\/09\/20\/sre-weekly-issue-287\/","url_meta":{"origin":261,"position":5},"title":"SRE Weekly Issue #287","date":"September 20, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Trying to figure out how to keep your APIs secure? You\u2019re not the only one. See how DataRobot is automating API security testing with StackHawk. https:\/\/sthwk.com\/DataRobot Articles Industry Interviews: Colm Doyle, Incident Commander at Slack Lots of details about how Slack\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/261","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=261"}],"version-history":[{"count":1,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/261\/revisions"}],"predecessor-version":[{"id":445,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/261\/revisions\/445"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=261"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=261"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=261"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}