{"id":177,"date":"2021-01-21T20:01:00","date_gmt":"2021-01-21T20:01:00","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2021\/01\/21\/budget-split-testing-a-trustworthy-and-powerful-approach-to-marketplace-a-b-testing\/"},"modified":"2021-02-02T13:46:58","modified_gmt":"2021-02-02T13:46:58","slug":"budget-split-testing-a-trustworthy-and-powerful-approach-to-marketplace-a-b-testing","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2021\/01\/21\/budget-split-testing-a-trustworthy-and-powerful-approach-to-marketplace-a-b-testing\/","title":{"rendered":"Budget-split testing: A trustworthy and powerful approach to marketplace A\/B testing"},"content":{"rendered":"<div class=\"resourceParagraph section\">\n<div class=\"component-anchor-container\">\n  <a class=\"component-anchor\" name=\"post_par_resourceparagraph\"><\/a>\n <\/div>\n<div class=\"resource-text-section\">\n<div class=\"resource-paragraph rich-text\">\n<p><i>Co-authors: <a href=\"https:\/\/www.linkedin.com\/in\/min-l-15696b43\" target=\"_blank\" rel=\"noopener\">Min Liu<\/a>, <a href=\"https:\/\/www.linkedin.com\/in\/vangelis-dimopoulos-70550925\/\" target=\"_blank\" rel=\"noopener\">Vangelis Dimopoulos<\/a>, <a href=\"https:\/\/www.linkedin.com\/in\/georis\/\" target=\"_blank\" rel=\"noopener\">Elise Georis<\/a>, <a href=\"https:\/\/www.linkedin.com\/in\/jialiang-mao-9125b7a6\/\" target=\"_blank\" rel=\"noopener\">Jialiang Mao<\/a>, <a href=\"https:\/\/www.linkedin.com\/in\/dibugger\/\" target=\"_blank\" rel=\"noopener\">Di Luo<\/a>, and <a href=\"https:\/\/www.linkedin.com\/in\/kang-kang-00673711\/\" target=\"_blank\" rel=\"noopener\">Kang Kang<\/a><\/i><\/p>\n<p>The LinkedIn ecosystem drives member and customer value through a series of marketplaces (e.g., the <a href=\"https:\/\/business.linkedin.com\/marketing-solutions\/cx\/17\/06\/advertise-on-linkedin?trk=sem_lms_gaw&amp;src=go-pa&amp;veh=LMS_NAMER_Core_USCA_Search_Google-Brand_DR-PRS_Broad_HeadTerms-Beta_All_English_Core_431812982384__%2Blinkedin%20%2Bads_c__kwd-28102859495_6458957180&amp;mcid=6612464045041733652&amp;cname=LMS_NAMER_Core_USCA_Search_Google-Brand_DR-PRS_Broad_HeadTerms-Beta_All_English_Core&amp;camid=6458957180&amp;asid=77594820416&amp;targetid=kwd-28102859495&amp;crid=431812982384&amp;placement=&amp;dev=c&amp;ends=1&amp;gclid=CjwKCAiAl4WABhAJEiwATUnEF9ob8yKM38IzQUlBVSTH7b5GrWg7668QYaXhCLj8vUPlthGLRLC09BoCJR0QAvD_BwE&amp;gclsrc=aw.ds\" target=\"_blank\" rel=\"noopener\">ads marketplace<\/a>, the <a href=\"https:\/\/business.linkedin.com\/talent-solutions\" target=\"_blank\" rel=\"noopener\">talent marketplace<\/a>, etc.). We maximize that value by making data-informed product decisions via <a href=\"https:\/\/engineering.linkedin.com\/blog\/2020\/a-b-testing-variant-assignment\" target=\"_blank\" rel=\"noopener\">A\/B testing<\/a>. Traditional A\/B tests on our marketplaces, however, are often statistically biased and under-powered. To mitigate this, we developed \u201cbudget-split\u201d testing, which provides more trustworthy and powerful marketplace A\/B testing. Read on to learn about the problem, solution, and successful results, using the <a href=\"https:\/\/business.linkedin.com\/marketing-solutions\/cx\/17\/06\/advertise-on-linkedin\" target=\"_blank\" rel=\"noopener\">ads marketplace<\/a> as a running example. For more technical details, please refer to the paper \u201c<a href=\"https:\/\/arxiv.org\/abs\/2012.08724\" target=\"_blank\" rel=\"noopener\">Trustworthy Online Marketplace Experimentation with Budget-split Design<\/a>.\u201d<\/p>\n<h2>Problems with marketplace A\/B testing<\/h2>\n<p>To add some important context, modern online ad marketplaces use <a href=\"http:\/\/www.eecs.tufts.edu\/~dsculley\/papers\/ad-click-prediction.pdf\" target=\"_blank\" rel=\"noopener\">auction-based models<\/a> for ad assignment. Advertisers set an objective, an audience, a campaign budget, and a bidding strategy to each ad campaign. Each \u201cresult\u201d (member click, view, etc., depending on the objective) utilizes a portion of the overall campaign budget, for a set duration, until the campaign ends or there is no more budget available. The maximum revenue generated by a campaign cannot exceed its set budget.<\/p>\n<p>When running A\/B tests on the ads marketplace, we noticed two types of problems:<\/p>\n<ol>\n<li>\n<p>When testing a new ad feature, we\u2019d often see a strong metric impact in our experiment, but wouldn\u2019t observe the same level of impact when launched to the entire marketplace.<\/p>\n<\/li>\n<li>\n<p>Many tests required an unacceptably long time to achieve statistically significant results.\u00a0<\/p>\n<\/li>\n<\/ol>\n<p>The first problem exemplified cannibalization bias, while the second stemmed from insufficient statistical power.<\/p>\n<p><b>Cannibalization bias<br \/> <\/b>We can illustrate cannibalization bias with a hypothetical example (note: real world manifestations of this bias are less extreme forms of this hypothetical). Suppose that we want to test how a new ad feature (e.g., improving the match between ads and members) impacts ad impressions and revenue. Prior to our experiment, let\u2019s say all ad campaigns were spending 100% of their budgets (i.e., no new feature can increase ads revenue further). If we test our new feature in a traditional A\/B test and observe increases in the number of ad impressions, the test would also show a corresponding increase in revenue for the treatment group. Once we launch the feature to the entire marketplace, however, we won\u2019t see that same increase in revenue because (remember) all campaigns were already spending 100% of their budgets. So why did our A\/B test lead us to the wrong conclusion?<\/p>\n<\/p><\/div>\n<\/p><\/div>\n<\/div>\n<div class=\"resourceImageBlock section\">\n<div class=\"component-anchor-container\">\n  <a class=\"component-anchor\" name=\"post_par_resourceimageblock\"><\/a>\n <\/div>\n<ul class=\"resource-image-block single\">\n<li class=\"resource-image\"> <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/content.linkedin.com\/content\/dam\/engineering\/site-assets\/images\/blog\/posts\/2021\/01\/budgetsplit2.png?resize=750%2C218&#038;ssl=1\" alt=\"illustration-of-treatment-and-control-groups-playing-tug-of-war-over-budget\" height=\"218\" width=\"750\"  data-recalc-dims=\"1\"> <\/li>\n<\/ul>\n<\/div>\n<div class=\"resourceParagraph section\">\n<div class=\"component-anchor-container\">\n  <a class=\"component-anchor\" name=\"post_par_resourceparagraph_1863129421\"><\/a>\n <\/div>\n<div class=\"resource-text-section\">\n<div class=\"resource-paragraph rich-text\">\n<p>This happens because the treatment and control groups compete for the same budget. In this example, budget shifts to treatment because it\u2019s performing better. So the revenue \u201cincrease\u201d that we observe in treatment simply reflects budget shifting between the groups, rather than a higher level of realized ads revenue.\u00a0<\/p>\n<p><b>Insufficient power<br \/> <\/b>Beyond cannibalization bias, marketplace A\/B tests that are randomized on small populations (e.g. advertisers) can suffer from <a href=\"https:\/\/www.statisticsdonewrong.com\/power.html\" target=\"_blank\" rel=\"noopener\">low statistical power<\/a>. As a result, <a href=\"https:\/\/www.kdd.org\/kdd2018\/accepted-papers\/view\/sqr-balancing-speed-quality-and-risk-in-online-experiments\" target=\"_blank\" rel=\"noopener\">testing velocity<\/a> is low, which creates a bottleneck for product development.<\/p>\n<h2>Solution: Budget-split testing<\/h2>\n<\/p><\/div>\n<\/p><\/div>\n<\/div>\n<div class=\"resourceImageBlock section\">\n<div class=\"component-anchor-container\">\n  <a class=\"component-anchor\" name=\"post_par_resourceimageblock_197399271\"><\/a>\n <\/div>\n<ul class=\"resource-image-block single\">\n<li class=\"resource-image\"> <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/content.linkedin.com\/content\/dam\/engineering\/site-assets\/images\/blog\/posts\/2021\/01\/budgetsplit6.png?resize=750%2C203&#038;ssl=1\" alt=\"illustration-of-budget-equally-split-between-control-and-treatment-groups\" height=\"203\" width=\"750\"  data-recalc-dims=\"1\"> <\/li>\n<\/ul>\n<\/div>\n<div class=\"resourceParagraph section\">\n<div class=\"component-anchor-container\">\n  <a class=\"component-anchor\" name=\"post_par_resourceparagraph_810266665\"><\/a>\n <\/div>\n<div class=\"resource-text-section\">\n<div class=\"resource-paragraph rich-text\">\n<p>We designed budget-split testing to solve for cannibalization bias and insufficient power.\u00a0<\/p>\n<p>First, we randomly split members into two equal-sized groups, with one group assigned to the treatment and the other to control. Then, we split the budget of each ad campaign into two identical \u201csub-campaigns,\u201d with each sub-campaign getting half of the original campaign\u2019s budget. Finally, we assign one of these sub-campaigns to the treatment member group and the other to the control member group.<\/p>\n<p>The two sub-campaigns act independently on their assigned members, so they can\u2019t compete for budget between treatment and control members. This functionally creates two identical marketplaces, where one has its members and sub-campaigns completely exposed to the treatment, while the other is completely exposed to the control. Directly comparing these two marketplaces measures impact without cannibalization bias. Furthermore, these tests run with a large member population (versus a relatively smaller advertiser population), which improves experiment power.<\/p>\n<h2>Implementation<\/h2>\n<p>We built budget-split testing with the following principles:\u00a0<\/p>\n<ol>\n<li>\n<p>The system must handle common test changes (e.g., turning tests on\/off, re-randomization, etc.).<\/p>\n<\/li>\n<li>\n<p>The system must perfectly separate budget between the two sub-campaigns.\u00a0<\/p>\n<\/li>\n<li>\n<p>The results must be easy to understand and must incorporate our existing business metrics.<\/p>\n<\/li>\n<\/ol>\n<p>The ads delivery system contains two main parts. The first is an ad server that responds to ad requests and controls responses via a bidding\/pacing module. The second is a tracking\/billing service that tracks ad impressions, clicks, and costs (which are tallied at the campaign level).<\/p>\n<p>We enabled budget-split testing as follows:<\/p>\n<ol>\n<li>\n<p>In the request handler tier of the ad server, we randomized all requests from a member into either treatment or control, depending on the member ID.\u00a0<\/p>\n<\/li>\n<li>\n<p>In the bidding\/pacing module, we replaced the campaign-level controls with sub-campaign level counterparts.\u00a0<\/p>\n<\/li>\n<li>\n<p>In the tracking service, we started tracking ad impressions and clicks at the sub-campaign level.<\/p>\n<\/li>\n<\/ol><\/div>\n<\/p><\/div>\n<\/div>\n<div class=\"resourceImageBlock section\">\n<div class=\"component-anchor-container\">\n  <a class=\"component-anchor\" name=\"post_par_resourceimageblock_207984110\"><\/a>\n <\/div>\n<ul class=\"resource-image-block single\">\n<li class=\"resource-image\"> <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/content.linkedin.com\/content\/dam\/engineering\/site-assets\/images\/blog\/posts\/2021\/01\/budgetsplit4.png?resize=750%2C558&#038;ssl=1\" alt=\"graphic-illustrating-ad-delivery-system-components\" height=\"558\" width=\"750\"  data-recalc-dims=\"1\"> <\/li>\n<\/ul>\n<\/div>\n<div class=\"resourceParagraph section\">\n<div class=\"component-anchor-container\">\n  <a class=\"component-anchor\" name=\"post_par_resourceparagraph_1997152932\"><\/a>\n <\/div>\n<div class=\"resource-text-section\">\n<div class=\"resource-paragraph rich-text\">\n<h2>Results<\/h2>\n<p><b>Mitigating bias<br \/> <\/b>We compared the results from a series of budget-split tests with the results from traditional A\/B tests that were set up in a nearly identical way and observed a 30-70% difference in measured impact between the two methodologies. Each budget-split test showed a reduction in true impact relative to the traditional A\/B test counterpart by member. This confirms our initial hypothesis that traditional A\/B tests are far less reliable than unbiased budget-split tests.\u00a0<\/p>\n<p><b>Improving power<br \/> <\/b>We also compared the power of budget-split tests with the power of both traditional A\/B tests (measured by campaigns) and <a href=\"https:\/\/arxiv.org\/abs\/2009.00148\" target=\"_blank\" rel=\"noopener\">\u201calternating-day\u201d tests<\/a> (a common marketplace testing workaround). Budget-split testing improved test sensitivity by up to 10X. Tests that used to require several weeks now only take 1-3 days.<\/p>\n<\/p><\/div>\n<\/p><\/div>\n<\/div>\n<div class=\"resourceImageBlock section\">\n<div class=\"component-anchor-container\">\n  <a class=\"component-anchor\" name=\"post_par_resourceimageblock_886932720\"><\/a>\n <\/div>\n<ul class=\"resource-image-block single\">\n<li class=\"resource-image\"> <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/i0.wp.com\/content.linkedin.com\/content\/dam\/engineering\/site-assets\/images\/blog\/posts\/2021\/01\/budgetsplit5.png?resize=750%2C269&#038;ssl=1\" alt=\"graphs-showing-improved-performance-of-budget-split-testing-compared-to-other-testing-types\" height=\"269\" width=\"750\"  data-recalc-dims=\"1\"> <\/li>\n<\/ul>\n<\/div>\n<div class=\"resourceParagraph section\">\n<div class=\"component-anchor-container\">\n  <a class=\"component-anchor\" name=\"post_par_resourceparagraph_314929880\"><\/a>\n <\/div>\n<div class=\"resource-text-section\">\n<div class=\"resource-paragraph rich-text\">\n<h2>Conclusion<\/h2>\n<p>Budget-split has mitigated cannibalization bias and magnified statistical power in our marketplace testing. This has since unblocked product launches with double-digit impact on member value (e.g., more relevant ads in the feed, a more engaging job seeker experience, etc.) and customer value (e.g., better return on ad spend, higher ROI for <a href=\"https:\/\/business.linkedin.com\/talent-solutions\" target=\"_blank\" rel=\"noopener\">job posters<\/a>, etc.). We hope that readers can derive similar value by applying our learnings to their own marketplaces.<\/p>\n<h2>Acknowledgments<\/h2>\n<p>We would like to thank\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/wei-wei-65055350\" target=\"_blank\" rel=\"noopener\">Wei Wei<\/a>\u00a0and\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/ishangupta91\" target=\"_blank\" rel=\"noopener\">Ishan Gupta<\/a>\u00a0for implementation of budget-split test in Ads Marketplace, and\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/qing-duan-89114447\" target=\"_blank\" rel=\"noopener\">Qing Duan<\/a>,\u00a0<a href=\"https:\/\/nam06.safelinks.protection.outlook.com\/?url=https%3A%2F%2Fwww.linkedin.com%2Fin%2Fjianqiangshen&amp;data=04%7C01%7Cmliu%40linkedin.com%7Cd032a9d4d33d4546621808d8c188f310%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637472141217243108%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=AE53vHUVD9IWqaLaZpdjwbt0dg1TQKh3n0hwtE1nor0%3D&amp;reserved=0\" target=\"_blank\" rel=\"noopener\">Jerry Shen<\/a>,\u00a0<a href=\"https:\/\/nam06.safelinks.protection.outlook.com\/?url=https%3A%2F%2Fwww.linkedin.com%2Fin%2Flindafayad&amp;data=04%7C01%7Cmliu%40linkedin.com%7Cd032a9d4d33d4546621808d8c188f310%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637472141217253069%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=HY7kRtosa0gv8k%2F9Dxr%2BwvNRNfYPubpGLPGEi9i%2B6qw%3D&amp;reserved=0\" target=\"_blank\" rel=\"noopener\">Linda Fayad<\/a>,\u00a0<a href=\"https:\/\/nam06.safelinks.protection.outlook.com\/?url=https%3A%2F%2Fwww.linkedin.com%2Fin%2Fgiorgiomartini0&amp;data=04%7C01%7Cmliu%40linkedin.com%7Cd032a9d4d33d4546621808d8c188f310%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637472141217253069%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=2gkLma%2F6AsCOLl0zztBYGTkZQAyGda0C7tr2nUQP7lE%3D&amp;reserved=0\" target=\"_blank\" rel=\"noopener\">Giorgio Martini<\/a>,\u00a0and other team members\u00a0from the LinkedIn Jobs Marketplace AI team for the design and implementation of budget-split testing for Jobs Marketplace.\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/ya-xu\" target=\"_blank\" rel=\"noopener\">Ya Xu<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/weitaoduan\/\" target=\"_blank\" rel=\"noopener\">Weitao Duan<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/parvezahammad\" target=\"_blank\" rel=\"noopener\">Parvez Ahammad<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/anujkmalhotra\" target=\"_blank\" rel=\"noopener\">Anuj Malhotra<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/leli\" target=\"_blank\" rel=\"noopener\">Le Li<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/onkar-dalal\" target=\"_blank\" rel=\"noopener\">Onkar Dalal<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/shahriarshariat\" target=\"_blank\" rel=\"noopener\">Shahriar Shariat<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/yi-zhang-7364ba2b\" target=\"_blank\" rel=\"noopener\">Yi Zhang<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/kaiyu-yang-49b72435\" target=\"_blank\" rel=\"noopener\">Kaiyu Yang<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/xingyaoye\" target=\"_blank\" rel=\"noopener\">Xingyao Ye<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/yang-zhao-33a7676a\" target=\"_blank\" rel=\"noopener\">Yang Zhao<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/jianqiangshen\" target=\"_blank\" rel=\"noopener\">Jerry Shen<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/stevena1\" target=\"_blank\" rel=\"noopener\">Steve Na<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/maheshgupta\/\" target=\"_blank\" rel=\"noopener\">Mahesh Gupta<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/drlebedev\/\" target=\"_blank\" rel=\"noopener\">Kirill Lebedev<\/a>,\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/mindaou\" target=\"_blank\" rel=\"noopener\">Mindaou Gu<\/a>, and\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/sumedhaswamy\" target=\"_blank\" rel=\"noopener\">Sumedha Swamy<\/a>\u00a0for the continued support and helpful discussions.\u00a0Stephen Lynch, Heyun Jeong, and Hannah Sills for reviews. Finally, thank you to <a href=\"https:\/\/www.linkedin.com\/in\/stacievu\" target=\"_blank\" rel=\"noopener\">Stacie Vu<\/a> for the graphics used in the first half of this post.<\/p>\n<\/p><\/div>\n<\/p><\/div>\n<\/div>\n<p><a href=\"https:\/\/engineering.linkedin.com\/blog\/2021\/budget-split-testing\" target=\"_blank\" rel=\"noopener\">Read More<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Co-authors: Min Liu, Vangelis Dimopoulos, Elise Georis, Jialiang Mao, Di Luo, and Kang Kang The LinkedIn ecosystem drives member and customer value through a series of marketplaces (e.g., the ads marketplace, the talent marketplace, etc.). We maximize that value by making data-informed product decisions via A\/B testing. Traditional A\/B tests on our marketplaces, however, are&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2021\/01\/21\/budget-split-testing-a-trustworthy-and-powerful-approach-to-marketplace-a-b-testing\/\">Continue reading <span class=\"screen-reader-text\">Budget-split testing: A trustworthy and powerful approach to marketplace A\/B testing<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[1,7],"tags":[],"class_list":["post-177","post","type-post","status-publish","format-standard","hentry","category-external","category-technology","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":555,"url":"https:\/\/fde.cat\/index.php\/2022\/03\/17\/detecting-silent-errors-in-the-wild-combining-two-novel-approaches-to-quickly-detect-silent-data-corruptions-at-scale\/","url_meta":{"origin":177,"position":0},"title":"Detecting silent errors in the wild: Combining two novel approaches to quickly detect silent data corruptions at scale","date":"March 17, 2022","format":false,"excerpt":"Silent data corruptions (SDCs), data errors that go undetected by the larger system, are a widespread problem for large-scale infrastructure systems. Left undetected, these types of corruptions can cause data loss and propagate across the stack and manifest as application-level problems. Silent data corruptions (SDC) in hardware impact computational integrity\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":493,"url":"https:\/\/fde.cat\/index.php\/2021\/10\/20\/autonomous-testing-of-services-at-scale\/","url_meta":{"origin":177,"position":1},"title":"Autonomous testing of services at scale","date":"October 20, 2021","format":false,"excerpt":"Enabling developers to prototype, test, and iterate on new features quickly is important to Facebook\u2019s success. To do this effectively, it\u2019s key to have a stable infrastructure that doesn\u2019t introduce unnecessary friction. This gets significantly more challenging when the infrastructure in question must also scale to support more than 3\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":528,"url":"https:\/\/fde.cat\/index.php\/2022\/01\/06\/managing-availability-in-service-based-deployments-with-continuous-testing\/","url_meta":{"origin":177,"position":2},"title":"Managing Availability in Service Based Deployments with Continuous Testing","date":"January 6, 2022","format":false,"excerpt":"The Problem At Salesforce, trust is our number one value. What this equates to is that our customers need to trust us; trust us to safeguard their data, trust that we will keep our services up and running, and trust that we will be there for them when they need\u00a0us.\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":887,"url":"https:\/\/fde.cat\/index.php\/2024\/06\/25\/the-future-of-ai-testing-salesforces-next-gen-framework-for-ai-model-performance\/","url_meta":{"origin":177,"position":3},"title":"The Future of AI Testing: Salesforce\u2019s Next Gen Framework for AI Model Performance","date":"June 25, 2024","format":false,"excerpt":"In our \u201cEngineering Energizers\u201d Q&A series, we explore the innovative minds shaping the future of Salesforce engineering. Today, we meet Erwin Karbasi, who leads the development of the Salesforce Central Evaluation Framework (SF Eval), a revolutionary internal tool used by Salesforce engineers to assess the performance of generative AI models.\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":860,"url":"https:\/\/fde.cat\/index.php\/2024\/04\/24\/inside-data-clouds-secret-formula-for-processing-one-quadrillion-records-monthly\/","url_meta":{"origin":177,"position":4},"title":"Inside Data Cloud\u2019s Secret Formula for Processing One Quadrillion Records Monthly","date":"April 24, 2024","format":false,"excerpt":"In our \u201cEngineering Energizers\u201d Q&A series, we explore the inspiring journeys of engineering leaders who have significantly advanced their fields. Today, we meet Soumya KV, who spearheads the development of the Data Cloud\u2019s internal apps layer at Salesforce. Her India-based team specializes in advanced data segmentation and activation, enabling tailored\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":884,"url":"https:\/\/fde.cat\/index.php\/2024\/06\/21\/how-einstein-copilot-sharpens-large-language-model-outputs-and-redefines-ai-data-testing\/","url_meta":{"origin":177,"position":5},"title":"How Einstein Copilot Sharpens Large Language Model Outputs and Redefines AI Data Testing","date":"June 21, 2024","format":false,"excerpt":"In our \u201cEngineering Energizers\u201d Q&A series, we explore the paths of engineering leaders who have attained significant accomplishments in their respective fields. Today, we spotlight Armita Peymandoust, Senior Vice President of Software Engineering at Salesforce, who spearheads the development of Einstein Copilot, a conversational AI assistant for CRM that integrates\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/177","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=177"}],"version-history":[{"count":1,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/177\/revisions"}],"predecessor-version":[{"id":205,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/177\/revisions\/205"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=177"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=177"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=177"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}