Transforming Service Reliability Through an SLOs-Driven Culture & Platform

At Salesforce, Trust is our number-one value, and it has its own special meaning to each part of the company. In our Technology, Marketing, & Products (TMP) organization, a big part of Trust is providing highly reliable Salesforce experiences to our customers, which can be challenging because of the scale of the Salesforce infrastructure, its… Continue reading Transforming Service Reliability Through an SLOs-Driven Culture & Platform

Published
Categorized as Technology

SRE Weekly Issue #316

View on sreweekly.com I’m on vacation, so I prepared this issue in advance. Practically speaking, that just means there’s no Outages section this week. See you all next week! P.S. Okay, I know I said no outages, but I will say that I’m keeping an eye on the Southwest Airlines outage, because we’re kind of… Continue reading SRE Weekly Issue #316

Published
Categorized as SRE

How Meta enables de-identified authentication at scale

Data minimization — collecting the minimum amount of data required to support our services — is one of our core principles at Meta as we continue developing new privacy-enhancing technologies (PETs). We are constantly seeking ways to improve privacy and protect user data on our family of products. Previously, we’ve approached data minimization by exploring… Continue reading How Meta enables de-identified authentication at scale

Published
Categorized as Technology

Investigate Issues with Ease by Adding a Correlation ID to your API

With APIs becoming more complex and distributed, developers sometimes struggle to find the relevant logs when they need to investigate a specific issue. In the new Salesforce Commerce APIs (SCAPI), we created such an architecture of distributed systems and recognized this problem early. Our approach to mitigate it was the introduction of a correlation ID. This ID… Continue reading Investigate Issues with Ease by Adding a Correlation ID to your API

Published
Categorized as Technology

Investigate Issues with Ease by Adding a Correlation ID to your API

With APIs becoming more complex and distributed, developers sometimes struggle to find the relevant logs when they need to investigate a specific issue. In the new Salesforce Commerce APIs (SCAPI), we created such an architecture of distributed systems and recognized this problem early. Our approach to mitigate it was the introduction of a correlation ID.… Continue reading Investigate Issues with Ease by Adding a Correlation ID to your API

Published
Categorized as Technology

SRE Weekly Issue #315

View on sreweekly.com I’m going on vacation, so I’m going to prepare next week’s issue in advance. It’ll look much like most issues, except there won’t be an Outages section. See you all in two weeks! A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like… Continue reading SRE Weekly Issue #315

Published
Categorized as SRE

How This Journalist-Turned-Engineer Promotes Equality and Inclusion Through Digital Allyship

Crystal Preston-Watson is an Accessibility and Quality Engineer based in Denver, Colorado. She is the Senior Digital Accessibility Analyst at Salesforce. Crystal believes that accessibility is a civil and human right and is dedicated to making innovative, inclusive, and accessible applications for everyone. Crystal Preston-Watson’s driving principle as a Senior Digital Accessibility Analyst in Salesforce’s Office… Continue reading How This Journalist-Turned-Engineer Promotes Equality and Inclusion Through Digital Allyship

Published
Categorized as Technology

SRE Weekly Issue #314

View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and adding responders, postmortem timeline, setting up reminders, and more. Book a demo (+ get a snazzy Rootly lego set): https://rootly.com/demo/ Articles Slight Reliability Episode 1… Continue reading SRE Weekly Issue #314

Published
Categorized as SRE

Detecting silent errors in the wild: Combining two novel approaches to quickly detect silent data corruptions at scale

Silent data corruptions (SDCs), data errors that go undetected by the larger system, are a widespread problem for large-scale infrastructure systems. Left undetected, these types of corruptions can cause data loss and propagate across the stack and manifest as application-level problems. Silent data corruptions (SDC) in hardware impact computational integrity for large-scale applications. Sources of… Continue reading Detecting silent errors in the wild: Combining two novel approaches to quickly detect silent data corruptions at scale

Published
Categorized as Technology

VESPA: Static profiling for binary optimization

What the research is: Recent research has demonstrated that binary optimization is important for achieving peak performance for various applications. For instance, the state-of-the-art BOLT binary optimizer developed at Meta, which is part of the LLVM Compiler Project, significantly improves the performance of highly optimized binaries produced using compilers’ most aggressive optimizations, such as profile-guided… Continue reading VESPA: Static profiling for binary optimization

Published
Categorized as Technology