SRE Weekly Issue #318

View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly đźš’. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https://rootly.com/demo/ Articles Errors are constructed, not… Continue reading SRE Weekly Issue #318

Published
Categorized as SRE

Onboarding SLOs for Salesforce services

At Salesforce, we operate thousands of services of various sizes: monolith and micro-services, both customer-facing and internal, across multiple substrates, i.e. first party and public cloud infrastructure. In our earlier blog “READS: Service Health Metrics,” we talked about the Service Level Objective (SLO) framework called READS that we developed Salesforce to standardize SLO tracking for… Continue reading Onboarding SLOs for Salesforce services

Published
Categorized as Technology

Onboarding SLOs for Salesforce services

At Salesforce, we operate thousands of services of various sizes: monolith and micro-services, both customer-facing and internal, across multiple substrates, i.e. first party and public cloud infrastructure. In our earlier blog “READS: Service Health Metrics,” we talked about the Service Level Objective (SLO) framework called READS that we developed at Salesforce to standardize SLO tracking… Continue reading Onboarding SLOs for Salesforce services

Published
Categorized as Technology

SRE Weekly Issue #317

View on sreweekly.com Bit of a short issue this week, as I’m currently recovering from COVID-19. Please don’t worry! I seem to have a very minor case, likely thanks in large part to vaccination and masking. I mostly just feel tired. A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly đźš’.… Continue reading SRE Weekly Issue #317

Published
Categorized as SRE

Transforming Service Reliability Through an SLOs-Driven Culture & Platform

At Salesforce, Trust is our number-one value, and it has its own special meaning to each part of the company. In our Technology, Marketing, & Products (TMP) organization, a big part of Trust is providing highly reliable Salesforce experiences to our customers, which can be challenging because of the scale of the Salesforce infrastructure, its… Continue reading Transforming Service Reliability Through an SLOs-Driven Culture & Platform

Published
Categorized as Technology

Transforming Service Reliability Through an SLOs-Driven Culture & Platform

At Salesforce, Trust is our number-one value, and it has its own special meaning to each part of the company. In our Technology, Marketing, & Products (TMP) organization, a big part of Trust is providing highly reliable Salesforce experiences to our customers, which can be challenging because of the scale of the Salesforce infrastructure, its… Continue reading Transforming Service Reliability Through an SLOs-Driven Culture & Platform

Published
Categorized as Technology

SRE Weekly Issue #316

View on sreweekly.com I’m on vacation, so I prepared this issue in advance. Practically speaking, that just means there’s no Outages section this week. See you all next week! P.S. Okay, I know I said no outages, but I will say that I’m keeping an eye on the Southwest Airlines outage, because we’re kind of… Continue reading SRE Weekly Issue #316

Published
Categorized as SRE

How Meta enables de-identified authentication at scale

Data minimization — collecting the minimum amount of data required to support our services — is one of our core principles at Meta as we continue developing new privacy-enhancing technologies (PETs). We are constantly seeking ways to improve privacy and protect user data on our family of products. Previously, we’ve approached data minimization by exploring… Continue reading How Meta enables de-identified authentication at scale

Published
Categorized as Technology

Investigate Issues with Ease by Adding a Correlation ID to your API

With APIs becoming more complex and distributed, developers sometimes struggle to find the relevant logs when they need to investigate a specific issue. In the new Salesforce Commerce APIs (SCAPI), we created such an architecture of distributed systems and recognized this problem early. Our approach to mitigate it was the introduction of a correlation ID. This ID… Continue reading Investigate Issues with Ease by Adding a Correlation ID to your API

Published
Categorized as Technology

Investigate Issues with Ease by Adding a Correlation ID to your API

With APIs becoming more complex and distributed, developers sometimes struggle to find the relevant logs when they need to investigate a specific issue. In the new Salesforce Commerce APIs (SCAPI), we created such an architecture of distributed systems and recognized this problem early. Our approach to mitigate it was the introduction of a correlation ID.… Continue reading Investigate Issues with Ease by Adding a Correlation ID to your API

Published
Categorized as Technology