SRE Weekly Issue #272

View on sreweekly.com A message from our sponsor, StackHawk: See how automated security testing can change how your teams find and fix security vulnerabilities. http://sthwk.com/security-automation Articles [Salesforce] Multi-Instance Service Disruption on May 11-12, 2021 Salesforce has posted a ton of information about their major outage two weeks ago. It involved a change to their DNS… Continue reading SRE Weekly Issue #272

Published
Categorized as SRE

How Facebook deals with PCIe faults to keep our data centers running reliably

Peripheral component interconnect express (PCIe) hardware continues to push the boundaries of computing thanks to advances in transfer speeds, the number of available lanes for simultaneous data delivery, and a comparatively small footprint on motherboards. Today, PCIe connectivity-based hardware delivers faster data transfers and is one of the de facto methods to connect components to… Continue reading How Facebook deals with PCIe faults to keep our data centers running reliably

Published
Categorized as Technology

SRE Weekly Issue #273

View on sreweekly.com A message from our sponsor, StackHawk: StackHawk is helping One Medical equip developers with automated security testing and self-service remediations. See how: http://sthwk.com/onemedical Articles Incident Management vs. Incident Response What indeed? It depends on who you ask. Quentin Rousseau — Rootly Cores that don’t count This academic paper explains Google’s efforts toward… Continue reading SRE Weekly Issue #273

Published
Categorized as SRE

API Federation: growing scalable API landscapes

Organizations embrace micro-services and event-driven APIs in their technology platforms to try to achieve the promise of greater agility, increased innovation, and more autonomy for their development teams. However, after the initial success, it is not unusual for organizations to face difficulties when they try to scale their distributed platforms. At this point, with the… Continue reading API Federation: growing scalable API landscapes

Published
Categorized as Technology

116. Success From Anywhere

ListenSubscribe COVID-19 has created massive changes to the way we work, not only bringing the remote work experience to the masses but creating an opportunity to redesign offices to suit flex workers. In this week’s podcast episode, Greg Nokes welcomes Lisa Marshall, Senior Vice President of Technology, People, Innovation & Learning at Salesforce. Marshall shares… Continue reading 116. Success From Anywhere

Published
Categorized as Technology

A Deep Dive on Text Classification at Salesforce

published on Towards Data Science Putting from a Sand Trap (Image by Author) We’re excited to announce that Noah Burbank, a Principal Data Scientist in Sales Cloud, has recently published a deep dive into text classification at Salesforce on Towards Data Science. The article, How to choose the right model for text classification in an organizational setting,… Continue reading A Deep Dive on Text Classification at Salesforce

Published
Categorized as Technology

SRE Weekly Issue #274

View on sreweekly.com A message from our sponsor, StackHawk: Join the GraphQL Security Testing Learning Lab on June 29 at 9 AM PT. Learn how to run automated security testing against your GraphQL APIs so you can find and fix vulnerabilities fast. http://sthwk.com/graphql-learning-lab Articles Chicken Soup for the SLO The last section suggests selling SLOs… Continue reading SRE Weekly Issue #274

Published
Categorized as SRE

Network hose: Managing uncertain network demand with model simplicity

Our production backbone network connects our data centers and delivers content to our users. The network supports a vast number of different services, distributed across a multitude of data centers. Traffic patterns shift over time from one data center to another due to the introduction of new services, service architecture changes, changes in user behavior,… Continue reading Network hose: Managing uncertain network demand with model simplicity

Published
Categorized as Technology

SRE Weekly Issue #275

View on sreweekly.com A message from our sponsor, StackHawk: Join ZAP Founder & Project Lead Simon Bennetts on June 30 for a live AMA where he will be answering questions on all things open source and AppSec. Register: http://sthwk.com/Simon-AMA Articles Practical Guide to SRE: Incident Severity Levels Here’s a take on incident severity levels. I… Continue reading SRE Weekly Issue #275

Published
Categorized as SRE

Meet Kats — a one-stop shop for time series analysis

What it is:  A new library to analyze time series data. Kats is a lightweight, easy-to-use, and generalizable framework for generic time series analysis, including forecasting, anomaly detection, multivariate analysis, and feature extraction/embedding. To the best of our knowledge, Kats is the first comprehensive Python library for generic time series analysis, which provides both classical… Continue reading Meet Kats — a one-stop shop for time series analysis

Published
Categorized as Technology