Running Border Gateway Protocol in large-scale data centers

What the research is: A first-of-its-kind study that details the scalable design, software implementation, and operations of Facebook’s data center routing design, based on Border Gateway Protocol (BGP). BGP was originally designed to interconnect autonomous internet service providers (ISPs) on the global internet. Highly scalable and widely acknowledged as an attractive choice for routing, BGP… Continue reading Running Border Gateway Protocol in large-scale data centers

Published
Categorized as Technology

SRE Weekly Issue #270

View on sreweekly.com A message from our sponsor, StackHawk: APIs are not only the backbone of modern application architecture, but they are also a key part of security. Discover what API security testing is, how it works, and get started using API security tools http://sthwk.com/API-security Articles Thundering herds, noisy neighbours, and retry storms This is… Continue reading SRE Weekly Issue #270

Published
Categorized as SRE

Designing Accessible Builder Apps

Global Accessibility Awareness Day (GAAD) highlights the importance of Digital Access and Inclusion for over 1 Billion People with Disabilities around the world. We enthusiastically celebrate GAAD at Salesforce because it directly speaks to our role in creating a more inclusive and just world. The World Health Organization defines Disability as “…a mismatched interaction between… Continue reading Designing Accessible Builder Apps

Published
Categorized as Technology

Peering automation at Facebook

Traffic on the internet travels across many different kinds of links. A fast and reliable way to exchange traffic between different networks and service providers is through peering. Initially, we managed peering via a time-intensive manual process. Reliable peering is essential for Facebook and for everyone’s internet use. But there is no industry standard for… Continue reading Peering automation at Facebook

Published
Categorized as Technology

SRE Weekly Issue #271

View on sreweekly.com A message from our sponsor, StackHawk: Join StackHawk on Tuesday, May 25 for a hands-on authenticated security testing workshop. Follow along as we walk through three common authentication scenarios step-by-step. Register: http://sthwk.com/auth-workshop Articles Naming names in incident writeups Should you keep things anonymous (“an engineer”), or should you say exactly who did… Continue reading SRE Weekly Issue #271

Published
Categorized as SRE

SRE Weekly Issue #272

View on sreweekly.com A message from our sponsor, StackHawk: See how automated security testing can change how your teams find and fix security vulnerabilities. http://sthwk.com/security-automation Articles [Salesforce] Multi-Instance Service Disruption on May 11-12, 2021 Salesforce has posted a ton of information about their major outage two weeks ago. It involved a change to their DNS… Continue reading SRE Weekly Issue #272

Published
Categorized as SRE

How Facebook deals with PCIe faults to keep our data centers running reliably

Peripheral component interconnect express (PCIe) hardware continues to push the boundaries of computing thanks to advances in transfer speeds, the number of available lanes for simultaneous data delivery, and a comparatively small footprint on motherboards. Today, PCIe connectivity-based hardware delivers faster data transfers and is one of the de facto methods to connect components to… Continue reading How Facebook deals with PCIe faults to keep our data centers running reliably

Published
Categorized as Technology

SRE Weekly Issue #273

View on sreweekly.com A message from our sponsor, StackHawk: StackHawk is helping One Medical equip developers with automated security testing and self-service remediations. See how: http://sthwk.com/onemedical Articles Incident Management vs. Incident Response What indeed? It depends on who you ask. Quentin Rousseau — Rootly Cores that don’t count This academic paper explains Google’s efforts toward… Continue reading SRE Weekly Issue #273

Published
Categorized as SRE

API Federation: growing scalable API landscapes

Organizations embrace micro-services and event-driven APIs in their technology platforms to try to achieve the promise of greater agility, increased innovation, and more autonomy for their development teams. However, after the initial success, it is not unusual for organizations to face difficulties when they try to scale their distributed platforms. At this point, with the… Continue reading API Federation: growing scalable API landscapes

Published
Categorized as Technology

116. Success From Anywhere

ListenSubscribe COVID-19 has created massive changes to the way we work, not only bringing the remote work experience to the masses but creating an opportunity to redesign offices to suit flex workers. In this week’s podcast episode, Greg Nokes welcomes Lisa Marshall, Senior Vice President of Technology, People, Innovation & Learning at Salesforce. Marshall shares… Continue reading 116. Success From Anywhere

Published
Categorized as Technology