An Intro To Mob Programming

Mob programming is a collective programming discipline that fosters greater interaction with teammates, collective code ownership, and a sharp reduction in knowledge siloing. It can help a growing team to onboard new members and enables stable teams to produce quality work consistently without having to worry about scheduling. The discipline — sometimes referred to as ensembled programming — came… Continue reading An Intro To Mob Programming

Published
Categorized as Technology

FOQS: Making a distributed priority queue disaster-ready

Facebook Ordered Queueing Service (FOQS) is a fully managed, distributed priority queueing service used for reliable message delivery among many services. FOQS has evolved from a regional deployment into a geo-distributed, global deployment to ensure that data stored within logical queues is highly available, even through large-scale disaster scenarios. Migrating to a global architecture required… Continue reading FOQS: Making a distributed priority queue disaster-ready

Published
Categorized as Technology

SRE Weekly Issue #305

View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https://rootly.com/demo/?utm_source=sreweekly Articles Avoiding Alert Fatigue: 8 Tips… Continue reading SRE Weekly Issue #305

Published
Categorized as SRE

5 Design Patterns for Building Observable Services

How can you make your services observable and embrace service ownership? This article presents a variety of universally applicable design patterns for the developer to consider. Design patterns in software development are repeatable solutions and best practices for solving commonly occurring problems. Even in the case of service monitoring, design patterns, when used appropriately, can… Continue reading 5 Design Patterns for Building Observable Services

Published
Categorized as Technology

SRE Weekly Issue #304

View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https://rootly.com/demo/?utm_source=sreweekly Articles Channel global decoupling for region… Continue reading SRE Weekly Issue #304

Published
Categorized as SRE

Managing Availability in Service Based Deployments with Continuous Testing

The Problem At Salesforce, trust is our number one value. What this equates to is that our customers need to trust us; trust us to safeguard their data, trust that we will keep our services up and running, and trust that we will be there for them when they need us. In the world of Software… Continue reading Managing Availability in Service Based Deployments with Continuous Testing

Published
Categorized as Technology

SRE Weekly Issue #303

View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo:https://rootly.com/demo/?utm_source=sreweekly Articles Hot Takes on Code Freezes There are way too many gorgeous, mind-blowing… Continue reading SRE Weekly Issue #303

Published
Categorized as SRE

SRE Netflix at SRECon

190 Countries and 5 CORE SREs by Jonah Horowitz How does Netflix scale SRE? How do we manage over 70 million customers around the world without a 24/7 operations center? With tens of thousands of Linux instances in a distributed system architecture, and thousands of daily production changes, it’s an environment that’s both challenging and… Continue reading SRE Netflix at SRECon

SRE Weekly Issue #302

View on sreweekly.com Happy holidays, for those that celebrate! I put this issue together in advance, so no Outages section this week. A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up… Continue reading SRE Weekly Issue #302

Published
Categorized as SRE