{"id":304,"date":"2021-08-31T14:40:03","date_gmt":"2021-08-31T14:40:03","guid":{"rendered":"https:\/\/fde.cat\/?p=304"},"modified":"2021-08-31T14:40:03","modified_gmt":"2021-08-31T14:40:03","slug":"sre-weekly-issue-269","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-269\/","title":{"rendered":"SRE Weekly Issue #269"},"content":{"rendered":"<p><a href=\"http:\/\/sreweekly.com\/sre-weekly-issue-269\/\" title=\"Permalink to SRE Weekly Issue #269\" class=\"email_only\">View on sreweekly.com<\/a><\/p>\n<div class=\"sreweekly-sponsor-message\">\n<h2>A message from our sponsor, StackHawk:<\/h2>\n<p>Tune into ZAPCon After Hours this Tuesday at 8 am PT to learn how to include automated security testing in your builds with ZAP<br \/>\n<a href=\"http:\/\/sthwk.com\/after-hours-3\">http:\/\/sthwk.com\/after-hours-3<\/a><\/p>\n<\/div>\n<h2>Articles<\/h2>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/netflixtechblog.com\/edgar-solving-mysteries-faster-with-observability-e1a76302c71f\">Edgar: Solving Mysteries Faster with Observability<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>We built Edgar to ease this burden, by empowering our users to troubleshoot distributed systems efficiently with the help of a summarized presentation of request tracing, logs, analysis, and metadata.<\/p>\n<p>Kevin Lew, Maulik Pandey, Narayanan Arunachalam, Dustin Haffner, Andrei Ushakov, Seth Katz, Greg Burrell, Ram Vaithilingam, Mike Smith and Elizabeth Carretto \u2014 Netflix<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/victorops.com\/blog\/the-comprehensive-site-reliability-engineering-sre-pdf\">The Comprehensive Site Reliability Engineering (SRE) PDF<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The PDF covers 5 main areas:<\/p>\n<p>Availability<br \/>\nPerformance<br \/>\nMonitoring<br \/>\nIncident Response<br \/>\nPreparation<\/p>\n<p>No account required or form to fill out to download the PDF.<\/p>\n<p>Splunk\/VictorOps<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.blameless.com\/blog\/what-are-mttx-metrics-good-for\">What are MTTx Metrics Good For? Let\u2019s Find Out.<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>This one\u2019s especially interesting for the section about what MTTx metrics <em>aren\u2019t<\/em> good for, and the following section on how to improve them.<\/p>\n<p>Emily Arnott \u2014 Blameless<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/tech.ebayinc.com\/engineering\/resiliency-and-disaster-recovery-with-kafka\/\">Resiliency and Disaster Recovery with Kafka<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>If you\u2019re interested in deploying Kafka in a multi-region configuration, eBay has put quite a bit of thought into this and has a lot to share.<\/p>\n<p> Engin Yoeyen \u2014 eBay<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.verica.io\/blog\/what-chaos-engineering-is-and-isnt\/\">What Chaos Engineering Is (and Isn\u2019t)<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>Straight from someone who was there from the start. The \u201cwhat chaos engineering is not\u201d section is especially enlightening.<\/p>\n<p>Casey Rosenthal \u2014 Verica<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/status.heroku.com\/incidents\/2226\">Heroku incident #2226 follow-up: Private Space apps experiencing domain to SSL cert mapping errors<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>The last paragraph regarding \u201cunknown unknowns\u201d is noteworthy.<\/p>\n<p>Heroku<\/p>\n<\/div>\n<\/div>\n<div class=\"sreweekly-entry\">\n<div class=\"sreweekly-title\"><a href=\"https:\/\/www.gremlin.com\/\/blog\/failover-conf-follow-up-your-team-and-culture-questions-answered\/\">Failover Conf follow-up: Your team and culture questions answered!<\/a><\/div>\n<div class=\"sreweekly-description\">\n<p>There are some great questions in here on blamelessness and full service ownership.<\/p>\n<p>James Thigpen \u2014 Gremlin<\/p>\n<\/div>\n<\/div>\n<h2>Outages<\/h2>\n<p><a href=\"https:\/\/status.cloud.google.com\/incidents\/eCPQKkKcFy6NYXExnPXL\">Google Cloud Platform us-west2 region<\/a><\/p>\n<p>They posted a detailed follow-up at the above link.<\/p>\n<p><a href=\"https:\/\/www.dailymail.co.uk\/sciencetech\/article-9540515\/TikTok-appears-ban-accounts-overnight-outage.html\">TikTok<\/a><br \/>\n<a href=\"https:\/\/www.bleepingcomputer.com\/news\/technology\/network-solutions-and-registercom-hit-by-ongoing-dns-outage\/\">Network Solutions and Register.com<\/a><br \/>\n<a href=\"https:\/\/www.finews.asia\/finance\/34415-sgx-restores-services-after-outage-singapore-exchange\">Singapore Exchange (SGX)<\/a><br \/>\n<a href=\"https:\/\/reddit.statuspage.io\/incidents\/n65369gfkvk0\">reddit<\/a><br \/>\n<a href=\"https:\/\/twitter.com\/parler_app\/status\/1389870151726678023\">Parler<\/a><br \/>\nSRE WEEKLY<\/p>\n","protected":false},"excerpt":{"rendered":"<p>View on sreweekly.com A message from our sponsor, StackHawk: Tune into ZAPCon After Hours this Tuesday at 8 am PT to learn how to include automated security testing in your builds with ZAP http:\/\/sthwk.com\/after-hours-3 Articles Edgar: Solving Mysteries Faster with Observability We built Edgar to ease this burden, by empowering our users to troubleshoot distributed&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-269\/\">Continue reading <span class=\"screen-reader-text\">SRE Weekly Issue #269<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[8],"tags":[],"class_list":["post-304","post","type-post","status-publish","format-standard","hentry","category-sre","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":602,"url":"https:\/\/fde.cat\/index.php\/2022\/06\/27\/sre-weekly-issue-328\/","url_meta":{"origin":304,"position":0},"title":"SRE Weekly Issue #328","date":"June 27, 2022","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https:\/\/rootly.com\/demo\/\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":298,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-266\/","url_meta":{"origin":304,"position":1},"title":"SRE Weekly Issue #266","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Are you a ZAP user looking to automate your security testing? Make sure to tune in to ZAPCon After Hours on Tuesday at 8 am PT to see how you can use Jenkins and Zest scripts to automate ZAP. http:\/\/sthwk.com\/zapcon-ah Articles\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":511,"url":"https:\/\/fde.cat\/index.php\/2021\/12\/06\/sre-weekly-issue-299\/","url_meta":{"origin":304,"position":2},"title":"SRE Weekly Issue #299","date":"December 6, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo:https:\/\/rootly.com\/?utm_source=sreweekly Articles More More More! Why the Most Resilient\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":487,"url":"https:\/\/fde.cat\/index.php\/2021\/10\/11\/sre-weekly-issue-291\/","url_meta":{"origin":304,"position":3},"title":"SRE Weekly Issue #291","date":"October 11, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly \ud83d\ude92. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo: https:\/\/rootly.io\/?utm_source=sreweekly Articles Understanding How Facebook Disappeared from the\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":318,"url":"https:\/\/fde.cat\/index.php\/2021\/08\/31\/sre-weekly-issue-274\/","url_meta":{"origin":304,"position":4},"title":"SRE Weekly Issue #274","date":"August 31, 2021","format":false,"excerpt":"View on sreweekly.com A message from our sponsor, StackHawk: Join the GraphQL Security Testing Learning Lab on June 29 at 9 AM PT. Learn how to run automated security testing against your GraphQL APIs so you can find and fix vulnerabilities fast. http:\/\/sthwk.com\/graphql-learning-lab Articles Chicken Soup for the SLO The\u2026","rel":"","context":"In &quot;SRE&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":525,"url":"https:\/\/fde.cat\/index.php\/2021\/12\/28\/sre-netflix-at-srecon\/","url_meta":{"origin":304,"position":5},"title":"SRE Netflix at SRECon","date":"December 28, 2021","format":"video","excerpt":"190 Countries and 5 CORE SREs by Jonah Horowitz How does Netflix scale SRE? How do we manage over 70 million customers around the world without a 24\/7 operations center? With tens of thousands of Linux instances in a distributed system architecture, and thousands of daily production changes, it's an\u2026","rel":"","context":"In &quot;External&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/304","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=304"}],"version-history":[{"count":1,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/304\/revisions"}],"predecessor-version":[{"id":406,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/304\/revisions\/406"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=304"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=304"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=304"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}