Asicmon: A platform agnostic observability system for AI accelerators

We will be hosting a talk about our work on, “A Platform Agnostic Observability System for AI Accelerators” during our virtual Systems @Scale event at 10:20 a.m. PT on Wednesday, June 30, followed by a live Q&A session. Please submit any questions to systemsatscale@fb.com before the event. Accelerators are special-purpose hardware devices optimized for specific… Continue reading Asicmon: A platform agnostic observability system for AI accelerators

Published
Categorized as Technology

Driving towards an open internet ecosystem to help tackle the digital divide

Connectivity is an integral part of Facebook’s mission to bring people closer together, and the COVID-19 pandemic has only heightened the demand for critical internet access. According to the latest edition of our Inclusive Internet Index, nearly 70 percent of people around the world believe that increased internet usage in all aspects of their lives… Continue reading Driving towards an open internet ecosystem to help tackle the digital divide

Published
Categorized as Technology

SRE Weekly Issue #276

View on sreweekly.com A message from our sponsor, StackHawk: Get ready for some GraphQL! Tune in this Tuesday, June 29 at 9 AM PT for an automated GraphQL security testing learning lab. Register: http://sthwk.com/graphql-learning-lab Articles @GergelyOrosz on blaming the intern HBO accidentally sent an email to a bunch of people, and they tweeted (jokingly?) blaming… Continue reading SRE Weekly Issue #276

Published
Categorized as SRE

117. Open Source with Jim Jagielski

ListenSubscribe Jim Jagielski is the newest member of Salesforce’s Open Source Program Office, but he’s no newbie to open source. In this episode, he talks with Alyssa Arvin, Senior Program Manager for Open Source about his early explorations into open source software during his time as an actual rocket scientist at NASA, what he’s learned… Continue reading 117. Open Source with Jim Jagielski

Published
Categorized as Technology

Consolidating Facebook storage infrastructure with Tectonic file system

What the research is:  Tectonic, our data center scale distributed file system, enables better resource utilization, promotes simpler services, and requires less operational complexity than our previous approach. Our previous storage infrastructure consisted of a set of use-case specific storage systems. Clusters, or instances of these storage systems, used to scale to tens of petabytes.… Continue reading Consolidating Facebook storage infrastructure with Tectonic file system

Published
Categorized as Technology

Meet Kats — a one-stop shop for time series analysis

What it is:  A new library to analyze time series data. Kats is a lightweight, easy-to-use, and generalizable framework for generic time series analysis, including forecasting, anomaly detection, multivariate analysis, and feature extraction/embedding. To the best of our knowledge, Kats is the first comprehensive Python library for generic time series analysis, which provides both classical… Continue reading Meet Kats — a one-stop shop for time series analysis

Published
Categorized as Technology

SRE Weekly Issue #275

View on sreweekly.com A message from our sponsor, StackHawk: Join ZAP Founder & Project Lead Simon Bennetts on June 30 for a live AMA where he will be answering questions on all things open source and AppSec. Register: http://sthwk.com/Simon-AMA Articles Practical Guide to SRE: Incident Severity Levels Here’s a take on incident severity levels. I… Continue reading SRE Weekly Issue #275

Published
Categorized as SRE

Network hose: Managing uncertain network demand with model simplicity

Our production backbone network connects our data centers and delivers content to our users. The network supports a vast number of different services, distributed across a multitude of data centers. Traffic patterns shift over time from one data center to another due to the introduction of new services, service architecture changes, changes in user behavior,… Continue reading Network hose: Managing uncertain network demand with model simplicity

Published
Categorized as Technology

SRE Weekly Issue #274

View on sreweekly.com A message from our sponsor, StackHawk: Join the GraphQL Security Testing Learning Lab on June 29 at 9 AM PT. Learn how to run automated security testing against your GraphQL APIs so you can find and fix vulnerabilities fast. http://sthwk.com/graphql-learning-lab Articles Chicken Soup for the SLO The last section suggests selling SLOs… Continue reading SRE Weekly Issue #274

Published
Categorized as SRE

A Deep Dive on Text Classification at Salesforce

published on Towards Data Science Putting from a Sand Trap (Image by Author) We’re excited to announce that Noah Burbank, a Principal Data Scientist in Sales Cloud, has recently published a deep dive into text classification at Salesforce on Towards Data Science. The article, How to choose the right model for text classification in an organizational setting,… Continue reading A Deep Dive on Text Classification at Salesforce

Published
Categorized as Technology