READS: Service Health Metrics

As you scale your company’s software footprint, with Service-Oriented Architecture (SOA) or microservices architecture featuring 100s or 1000s of services or more, how do you keep track of the performance of every service in every region? How will you know whether you are tracking every service? Of course every service is different, but how do… Continue reading READS: Service Health Metrics

Published
Categorized as Technology

Behind the Scenes of Hyperforce: Salesforce’s Infrastructure for the Public Cloud

Salesforce has been running cloud infrastructure for over two decades, bringing companies and their customers together. When Salesforce first started out in 1999, the world was very different; back then, the only practical way to provide our brand of Software-As-A-Service was to run everything yourself — not just the software, but the servers, storage, networking devices, cooling, etc.… Continue reading Behind the Scenes of Hyperforce: Salesforce’s Infrastructure for the Public Cloud

Published
Categorized as Technology

SRE Weekly Issue #307

View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https://rootly.com/demo/?utm_source=sreweekly Articles Roblox Return to Service 10/28-10/31… Continue reading SRE Weekly Issue #307

Published
Categorized as SRE

Scaling cross-team contributions to a native mobile app

By Stephen Goldberg, Alex Sikora, and Jean Bovet Flagship applications are home to myriad functionalities that serve different parts of your userbase. Often, adding a new feature unintentionally causes reduced velocity, single points of failure, and monoliths that are hard to navigate. Such flagship apps are built from contributions from multiple teams each with varying degrees… Continue reading Scaling cross-team contributions to a native mobile app

Published
Categorized as Technology

SRE Weekly Issue #306

View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https://rootly.com/demo/?utm_source=sreweekly Articles The James Webb Space Telescope — Success… Continue reading SRE Weekly Issue #306

Published
Categorized as SRE

An Intro To Mob Programming

Mob programming is a collective programming discipline that fosters greater interaction with teammates, collective code ownership, and a sharp reduction in knowledge siloing. It can help a growing team to onboard new members and enables stable teams to produce quality work consistently without having to worry about scheduling. The discipline — sometimes referred to as ensembled programming — came… Continue reading An Intro To Mob Programming

Published
Categorized as Technology

FOQS: Making a distributed priority queue disaster-ready

Facebook Ordered Queueing Service (FOQS) is a fully managed, distributed priority queueing service used for reliable message delivery among many services. FOQS has evolved from a regional deployment into a geo-distributed, global deployment to ensure that data stored within logical queues is highly available, even through large-scale disaster scenarios. Migrating to a global architecture required… Continue reading FOQS: Making a distributed priority queue disaster-ready

Published
Categorized as Technology

SRE Weekly Issue #305

View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly shirt): https://rootly.com/demo/?utm_source=sreweekly Articles Avoiding Alert Fatigue: 8 Tips… Continue reading SRE Weekly Issue #305

Published
Categorized as SRE

5 Design Patterns for Building Observable Services

How can you make your services observable and embrace service ownership? This article presents a variety of universally applicable design patterns for the developer to consider. Design patterns in software development are repeatable solutions and best practices for solving commonly occurring problems. Even in the case of service monitoring, design patterns, when used appropriately, can… Continue reading 5 Design Patterns for Building Observable Services

Published
Categorized as Technology