SRE Weekly Issue #432

View on sreweekly.com A message from our sponsor, FireHydrant: We’ve gone all out on our new integration with Microsoft Teams. If you’re a MS Teams user, FireHydrant now supports the most comprehensive integration for incident management. Run the entire IM process without ever leaving the chat. https://firehydrant.com/blog/introducing-a-brand-new-microsoft-teams-integration/ Investigating Mysterious Kafka Broker I/O When Using Confluent… Continue reading SRE Weekly Issue #432

Published
Categorized as SRE

Hyperforce’s Template for Enhancing Developer Workflow: Inside the 7 Pillars of Agile Development

Written by Armin Bahramshahry and Shan Appajodu. Hyperforce is a pivotal infrastructure platform for Salesforce, enhancing global service delivery through top public cloud platforms for increased safety, scalability, and agility. Hyperforce enabled rollout of new innovations like Data Cloud and boosted the global scalability of Salesforce’s Core CRM. To help align developer agility with infrastructure… Continue reading Hyperforce’s Template for Enhancing Developer Workflow: Inside the 7 Pillars of Agile Development

SRE Weekly Issue #431

View on sreweekly.com A message from our sponsor, FireHydrant: We’ve gone all out on our new integration with Microsoft Teams. If you’re a MS Teams user, FireHydrant now supports the most comprehensive integration for incident management. Run the entire IM process without ever leaving the chat. https://firehydrant.com/blog/introducing-a-brand-new-microsoft-teams-integration/ Cloudflare incident on June 20, 2024 This is… Continue reading SRE Weekly Issue #431

Published
Categorized as SRE

The Future of AI Testing: Salesforce’s Next Gen Framework for AI Model Performance

In our “Engineering Energizers” Q&A series, we explore the innovative minds shaping the future of Salesforce engineering. Today, we meet Erwin Karbasi, who leads the development of the Salesforce Central Evaluation Framework (SF Eval), a revolutionary internal tool used by Salesforce engineers to assess the performance of generative AI models. Explore how SF Eval addresses… Continue reading The Future of AI Testing: Salesforce’s Next Gen Framework for AI Model Performance

Published
Categorized as Technology

Leveraging AI for efficient incident response

We’re sharing how we streamline system reliability investigations using a new AI-assisted root cause analysis system. The system uses a combination of heuristic-based retrieval and large language model-based ranking to speed up root cause identification during investigations. Our testing has shown this new system achieves 42% accuracy in identifying root causes for investigations at their… Continue reading Leveraging AI for efficient incident response

Published
Categorized as Technology

SRE Weekly Issue #430

View on sreweekly.com A message from our sponsor, FireHydrant: We’ve gone all out on our new integration with Microsoft Teams. If you’re a MS Teams user, FireHydrant now supports the most comprehensive integration for incident management. Run the entire IM process without ever leaving the chat. https://firehydrant.com/blog/introducing-a-brand-new-microsoft-teams-integration/ r/sre: Senior SRE looking for a resume review,… Continue reading SRE Weekly Issue #430

Published
Categorized as SRE

How Einstein Copilot Sharpens Large Language Model Outputs and Redefines AI Data Testing

In our “Engineering Energizers” Q&A series, we explore the paths of engineering leaders who have attained significant accomplishments in their respective fields. Today, we spotlight Armita Peymandoust, Senior Vice President of Software Engineering at Salesforce, who spearheads the development of Einstein Copilot, a conversational AI assistant for CRM that integrates data, metadata, prompts, and workflows… Continue reading How Einstein Copilot Sharpens Large Language Model Outputs and Redefines AI Data Testing

Published
Categorized as Technology

PVF: A novel metric for understanding AI systems’ vulnerability against SDCs in model parameters

We’re introducing parameter vulnerability factor (PVF), a novel metric for understanding and measuring AI systems’ vulnerability against silent data corruptions (SDCs) in model parameters. PVF can be tailored to different AI models and tasks, adapted to different hardware faults, and even extended to the training phase of AI models. We’re sharing results of our own… Continue reading PVF: A novel metric for understanding AI systems’ vulnerability against SDCs in model parameters

Published
Categorized as Technology

SRE Weekly Issue #429

View on sreweekly.com A message from our sponsor, FireHydrant: We’ve gone all out on our new integration with Microsoft Teams. If you’re a MS Teams user, FireHydrant now supports the most comprehensive integration for incident management. Run the entire IM process without ever leaving the chat. https://firehydrant.com/blog/introducing-a-brand-new-microsoft-teams-integration/ Virtualizing Our Storage Engine Time to get down… Continue reading SRE Weekly Issue #429

Published
Categorized as SRE