Co-authors: Zihan Li, Sudarshan Vasudevan, Lei Sun, and Shirshanka Das Data analytics and AI power many business-critical use cases at LinkedIn. We need to ingest data in a timely and reliable way from a variety of sources, including Kafka, Oracle, and Espresso, bringing it into our Hadoop data lake for subsequent processing by AI and… Continue reading FastIngest: Low-latency Gobblin with Apache Iceberg and ORC format
Category: External
Post feed from an external site
2020 year in review: Connectivity innovations, faster apps, and progress toward net zero
It goes without saying that 2020 has been a challenging year, to put it lightly. But if anything, the COVID-19 pandemic has shined a light on our need to connect as people. For Facebook, that meant our work has become more important than ever. Whether it was finding new and innovative ways to expand internet… Continue reading 2020 year in review: Connectivity innovations, faster apps, and progress toward net zero
A smaller, faster video calling library for our apps
We are rolling out a new video calling library to all the relevant products across our apps and services, including Instagram, Messenger, Portal, Workplace chat, etc. To create a library generic enough to support all these different use cases, we needed to rewrite our existing library from scratch using the latest version of the open… Continue reading A smaller, faster video calling library for our apps
Evenstar: The road to an alternative infra supply chain for RAN
Together with mobile network operators Airtel and Vodafone, and technology partners ADI, Altiostar, Baicells, Flex, Mavenir, MTI, Parallel Wireless, and Xilinx, we are focused on building general-purpose radio access network (RAN) reference designs for 4G and 5G networks in the Open RAN ecosystem. This work is being done as part of the Evenstar program. The… Continue reading Evenstar: The road to an alternative infra supply chain for RAN
A/B testing at LinkedIn: Assigning variants at scale
Co-authors: Alexander Ivaniuk and Weitao Duan Editor’s note: This blog post is the second in a series providing an overview and history of LinkedIn’s experimentation platform. The previous post on the history of LinkedIn’s experimentation infrastructure can be found here. Introducing variant assignment Previously on the blog, we’ve shared a look into how experimentation works… Continue reading A/B testing at LinkedIn: Assigning variants at scale
Co-experiences: Hanging out in the digital living room
In the spring of 2019 — a full year before the COVID-19 pandemic caused the entire world to turn our living rooms into offices, schools, and gathering places — Mark Zuckerberg shared a plan to create the digital equivalent of the living room, where people could connect and hang out together. That work was already underway early… Continue reading Co-experiences: Hanging out in the digital living room
Infer powering Microsoft’s Infer#, a new static analyzer for C#
What it is: Infer# brings the Infer static analysis platform to developers who use Microsoft’s C# programming language. It can already detect null-pointer dereference and resource leak bugs, thanks to bi-abduction analysis. Detection of race conditions based on RacerD analysis is also in the works. Infer# has been used to analyze Microsoft software, including Roslyn,… Continue reading Infer powering Microsoft’s Infer#, a new static analyzer for C#
Coral: A SQL translation, analysis, and rewrite engine for modern data lakehouses
Co-authors: Walaa Eldin Moustafa, Wenye Zhang, Sushant Raikar, Raymond Lam, Ron Hu, Shardul Mahadik, Laura Chen, Khai Tran, Chris Chen, and Nagarathnam Muthusamy Introduction At LinkedIn, our big data compute infrastructure continually grows over time, not only to keep pace with the growth in the number of data applications, or their domains spanning data curation,… Continue reading Coral: A SQL translation, analysis, and rewrite engine for modern data lakehouses
How do you test your tests?
Facebook’s codebase changes each day as engineers develop new features and optimizations for our apps. If not validated, each of these changes could potentially regress the functionality or reliability of our products for billions of people around the world. To mitigate this risk, we maintain an enormous suite of automated regression tests to cover various… Continue reading How do you test your tests?
How Facebook keeps its large-scale infrastructure hardware up and running
Facebook’s services rely on fleets of servers in data centers all over the globe — all running applications and delivering the performance our services need. This is why we need to make sure our server hardware is reliable and that we can manage server hardware failures at our scale with as little disruption to our… Continue reading How Facebook keeps its large-scale infrastructure hardware up and running