The post Introducing the next-gen Meta Training and Inference Accelerator appeared first on Engineering at Meta. Engineering at Meta
Month: April 2024
Enhancing AIOps Efficiency: Salesforce’s New Similarity Model Overcomes 4 Major Incident Management Challenges
Optimizing the management of alerts from monitoring tools is crucial for efficient operations. However, it can be challenging due to the lack of confirmation on whether subsequent alerts indicate the same underlying problem. This leads to a repetitive and time-consuming process for an organization’s operations team — including site reliability engineers, performance engineers and others… Continue reading Enhancing AIOps Efficiency: Salesforce’s New Similarity Model Overcomes 4 Major Incident Management Challenges
SRE Weekly Issue #419
View on sreweekly.com A message from our sponsor, FireHydrant: FireHydrant is now AI-powered for faster, smarter incidents! Power up your incidents with auto-generated real-time summaries, retrospectives, and status page updates. https://firehydrant.com/blog/ai-for-incident-management-is-here/ How Figma’s Databases Team Lived to Tell the Scale Our nine month journey to horizontally shard Figma’s Postgres stack, and the key to unlocking… Continue reading SRE Weekly Issue #419
Unveiling the Cutting-Edge Features of ML Console for AI Model Lifecycle Management
In our “Engineering Energizers” Q&A series, we explore the journeys of engineering leaders who have made remarkable contributions in their fields. Today, we meet Venkat Krishnamani, a Lead Member of the Technical Staff for Salesforce Engineering and the lead engineer for Salesforce Einstein’s Machine Learning (ML) Console. This vital tool for internal AI and ML… Continue reading Unveiling the Cutting-Edge Features of ML Console for AI Model Lifecycle Management
SRE Weekly Issue #418
View on sreweekly.com A message from our sponsor, FireHydrant: FireHydrant is now AI-powered for faster, smarter incidents! Power up your incidents with auto-generated real-time summaries, retrospectives, and status page updates. https://firehydrant.com/blog/ai-for-incident-management-is-here/ Redefining Observability The observability waters have been muddy for awhile, and this article does a great job of taking a step back and building… Continue reading SRE Weekly Issue #418