View on sreweekly.com Many apologies to Matt Cooper at GitHub, who is the actual author of the article Scaling Merge-ort Across GitHub from last week. Sorry for the mis-credit, Matt! A message from our sponsor, Rootly: When incidents impact your customers, failing to communicate with them effectively can erode trust even further and compound an… Continue reading SRE Weekly Issue #385
Category: SRE
Posts related to Site Reliability Engineering
SRE Weekly Issue #384
View on sreweekly.com A message from our sponsor, Rootly: When incidents impact your customers, failing to communicate with them effectively can erode trust even further and compound an already difficult situation. Learn the essentials of customer-facing incident communication in Rootly’s latest blog post: https://rootly.com/blog/the-medium-is-the-message-how-to-master-the-most-essential-incident-communication-channels Articles Scaling merge-ort across GitHub They tested this new git merge… Continue reading SRE Weekly Issue #384
SRE Weekly Issue #383
View on sreweekly.com A message from our sponsor, Rootly: Eliminate the anxiety around declaring an incident for nebulous problems by introducing a triage phase into your incident management process. Our latest blog posts dives into why the triage phase is so important, and how you can automate yours with Rootly. Read more on the Rootly… Continue reading SRE Weekly Issue #383
SRE Weekly Issue #382
View on sreweekly.com A message from our sponsor, Rootly: Eliminate the anxiety around declaring an incident for nebulous problems by introducing a triage phase into your incident management process. Our latest blog posts dives into why the triage phase is so important, and how you can automate yours with Rootly. Read more on the Rootly… Continue reading SRE Weekly Issue #382
SRE Weekly Issue #381
View on sreweekly.com A message from our sponsor, Rootly: Curious how companies like Elastic, Tripadvisor, and 100s of others leverage Rootly to manage incidents in Slack and unlock instant best practices? Check out this lightning demo: https://www.loom.com/share/051c4be0425a436e888dc0c3690855ad Articles The Pyramid of Alerting The Pyramid introduced in this article is three levels of monitoring: Operational, Data… Continue reading SRE Weekly Issue #381
SRE Weekly Issue #380
View on sreweekly.com A message from our sponsor, Rootly: Curious how companies like Elastic, Tripadvisor, and 100s of others leverage Rootly to manage incidents in Slack and unlock instant best practices? Check out this lightning demo: https://www.loom.com/share/051c4be0425a436e888dc0c3690855ad Articles Amazon Prime Video’s Microservices Move Doesn’t Lead to a Monolith after All Well, that cleared things up.… Continue reading SRE Weekly Issue #380
SRE Weekly Issue #379
View on sreweekly.com A message from our sponsor, Rootly: Curious how companies like Figma, Tripadvisor, and 100s of others leverage Rootly to manage incidents in Slack and unlock instant best practices? Check out this lightning demo: https://www.loom.com/share/051c4be0425a436e888dc0c3690855ad Articles The Saga Is Antipattern In case you weren’t familiar with the Saga pattern like I was, it’s… Continue reading SRE Weekly Issue #379
SRE Weekly Issue #378
View on sreweekly.com A message from our sponsor, Rootly: Curious how companies like Figma, Tripadvisor, and 100s of others leverage Rootly to manage incidents in Slack and unlock instant best practices? Check out this lightning demo: https://www.loom.com/share/051c4be0425a436e888dc0c3690855ad Articles “One-Engined-Zulu” This is the story of a fascinating incident in which a commercial airplane’s engine was ripped… Continue reading SRE Weekly Issue #378
SRE Weekly Issue #377
View on sreweekly.com A message from our sponsor, Rootly: Curious how companies like Figma, Tripadvisor, and 100s of others leverage Rootly to manage incidents in Slack and unlock instant best practices? Check out this lightning demo: https://www.loom.com/share/051c4be0425a436e888dc0c3690855ad Articles Why did AWS Support fail with US-EAST-1 again? AWS had a major Lambda outage in us-east-1, and… Continue reading SRE Weekly Issue #377
SRE Weekly Issue #376
View on sreweekly.com A message from our sponsor, Rootly: Curious how companies like Figma, Tripadvisor, and 100s of others leverage Rootly to manage incidents in Slack and unlock instant best practices? Check out this lightning demo: https://www.loom.com/share/051c4be0425a436e888dc0c3690855ad Articles 2023 03 08 Incident: A Deep Dive into Our Incident Response With 100 workstreams and over 500… Continue reading SRE Weekly Issue #376