{"id":697,"date":"2023-04-06T16:00:52","date_gmt":"2023-04-06T16:00:52","guid":{"rendered":"https:\/\/fde.cat\/index.php\/2023\/04\/06\/build-faster-with-buck2-our-open-source-build-system\/"},"modified":"2023-04-06T16:00:52","modified_gmt":"2023-04-06T16:00:52","slug":"build-faster-with-buck2-our-open-source-build-system","status":"publish","type":"post","link":"https:\/\/fde.cat\/index.php\/2023\/04\/06\/build-faster-with-buck2-our-open-source-build-system\/","title":{"rendered":"Build faster with Buck2: Our open source build system"},"content":{"rendered":"<p><span>Buck2, our <\/span><a href=\"https:\/\/github.com\/facebook\/buck2\" target=\"_blank\" rel=\"noopener\"><span>new open source, large-scale build system<\/span><\/a><span>, is now available on GitHub.<\/span><br \/>\n<span>Buck2 is an extensible and performant build system written in Rust and designed to make your build experience faster and more efficient.\u00a0<\/span><br \/>\n<span>In our internal tests at Meta, we observed that Buck2 completed builds 2x as fast as Buck1.<\/span><\/p>\n<p><span>Buck2, Meta\u2019s open source large-scale build system, is now publicly available via the<\/span><a href=\"https:\/\/buck2.build\/\" target=\"_blank\" rel=\"noopener\"> <span>Buck2 website<\/span><\/a><span> and<\/span><a href=\"https:\/\/github.com\/facebook\/buck2\" target=\"_blank\" rel=\"noopener\"> the Buck2<span> GitHub repository<\/span><\/a><span>. While it shares some commonalities with other build systems (like <a href=\"https:\/\/engineering.fb.com\/2017\/11\/09\/android\/rethinking-android-app-compilation-with-buck\/\" target=\"_blank\" rel=\"noopener\">Buck1<\/a> and <a href=\"https:\/\/engineering.fb.com\/2017\/06\/05\/web\/dev-tools-scale-2017-recap\/\" target=\"_blank\" rel=\"noopener\">Bazel<\/a>), Buck2 is a from-scratch rewrite. Buck2 features a complete separation of the core and language-specific rules, with increased parallelism, integration with remote execution and virtual file systems, and a redesigned console output. All of these changes are aimed at helping engineers and developers spend less time waiting, and more time iterating on their code.<\/span><\/p>\n<p><span>Thousands of developers at Meta are already using Buck2 and performing millions of builds per day, with builds completing twice as fast as with Buck1. Our own internal analysis has shown that engineers were able to produce meaningfully more code when their builds were executed by Buck2, and we hope the wider industry will also see benefits.<\/span><\/p>\n<h2><span>Why rebuild Buck?<\/span><\/h2>\n<p><span>Build systems stand between a programmer and running their code, so anything we can do to make the experience quicker or more productive directly impacts how effective a developer can be. The goal of Buck2 was to keep what we liked about Buck1 (the core concepts and workflows), draw inspiration from innovations after Buck1 (including <\/span><a href=\"https:\/\/bazel.build\/\" target=\"_blank\" rel=\"noopener\"><span>Bazel<\/span><\/a><span>, <\/span><a href=\"http:\/\/adapton.org\/\" target=\"_blank\" rel=\"noopener\"><span>Adapton<\/span><\/a><span>, and <\/span><a href=\"https:\/\/shakebuild.com\/\" target=\"_blank\" rel=\"noopener\"><span>Shake<\/span><\/a><span>), and focus on speed and enabling new experiences.<\/span><\/p>\n<p><span>Buck2\u2019s design is based on the following principles:<\/span><\/p>\n<p>The core build system has no knowledge of any language-specific rules.<span> Having the rules separated from the core means that the rules are easier to change and understand. The core of Buck2 is written in Rust, and its language rules (like how to build C++) are written in <\/span><a href=\"https:\/\/github.com\/facebookexperimental\/starlark-rust\" target=\"_blank\" rel=\"noopener\"><span>Starlark<\/span><\/a><span>. This separation is in contrast to Buck1 (where all rules are written in the core) and Bazel (where C++\/Java are written in the core). <\/span><br \/>\nThe build system is powered by a <a href=\"https:\/\/ndmitchell.com\/#build_01_jan_2022\" target=\"_blank\" rel=\"noopener\">single incremental dependency graph<\/a><span>, avoiding any phases (in contrast to Buck1 or Bazel). This decision eliminates many types of bugs and increases parallelism.<\/span><br \/>\nThe rules API is designed to contain advanced features for performance<span>, along with dynamic (or monadic) dependency features for expressibility. At the same time, these features are carefully restricted to ensure other properties (for example, fast queries or hermeticity) are not harmed.<\/span><br \/>\nThe open source release is almost identical to our internal version<span>. The only pieces swapped out are the toolchains (which point at the internal copies of our compilers) and remote execution (which points at our internal servers) <\/span><span>\u2014 <\/span><span>both have open source alternatives supplied. We are also releasing all the rules exactly as they are used internally. Furthermore, we have separated some of the logical components into separate crates (e.g. <\/span><a href=\"https:\/\/developers.facebook.com\/blog\/post\/2021\/04\/08\/rust-starlark-library\/\" target=\"_blank\" rel=\"noopener\"><span>Starlark<\/span><\/a><span>, <\/span><a href=\"https:\/\/developers.facebook.com\/blog\/post\/2022\/07\/21\/superconsole\/\" target=\"_blank\" rel=\"noopener\"><span>Superconsole<\/span><\/a><span>, <\/span><a href=\"https:\/\/github.com\/facebookexperimental\/allocative\" target=\"_blank\" rel=\"noopener\"><span>Allocative<\/span><\/a><span>, <\/span><a href=\"https:\/\/developers.facebook.com\/blog\/post\/2021\/06\/29\/rust-nibbles-gazebo-prelude\/\" target=\"_blank\" rel=\"noopener\"><span>Gazebo<\/span><\/a><span>) so that they can be used outside Buck2.<\/span><br \/>\nBuck2 is written to integrate with remote execution<span>, with the ability to run actions on remote machines. We use the same <\/span><a href=\"https:\/\/bazel.build\/remote\/rbe\" target=\"_blank\" rel=\"noopener\"><span>API as Bazel<\/span><\/a><span>, and have been testing remote execution with <\/span><a href=\"https:\/\/github.com\/buildbarn\" target=\"_blank\" rel=\"noopener\"><span>Buildbarn<\/span><\/a><span> and <\/span><a href=\"https:\/\/www.engflow.com\/\" target=\"_blank\" rel=\"noopener\"><span>EngFlow<\/span><\/a><span>. While not required (and not really expected for people starting out with the open source version), we are able to efficiently compute recursive digests and send them to remote execution efficiently.<\/span><br \/>\nBuck2 is written to integrate with virtual file systems<span>, where the entire repository is not all checked out, but fetched on demand as the files are accessed. In particular, we support <\/span><a href=\"https:\/\/github.com\/facebook\/sapling\/blob\/main\/eden\/fs\/docs\/Overview.md\" target=\"_blank\" rel=\"noopener\"><span>Sapling-based file systems<\/span><\/a><span>. To integrate well, we watch for file notifications (with <\/span><a href=\"https:\/\/facebook.github.io\/watchman\/\" target=\"_blank\" rel=\"noopener\"><span>Watchman<\/span><\/a><span>) and request both files and file-digests without direct file operations. The benefit is that we can make virtual file systems as fast as a full checkout, but with the benefits of much faster checkout and much lower disk usage.<\/span><\/p>\n<p><span>The key takeaway from all these improvements is that we have designed Buck2 to be fast. In real world usage, depending on the build, Buck2 is significantly faster than Buck1. If there are no source code changes, Buck2 is almost instant on subsequent builds. If there is a lot of work to do, Buck2 starts executing faster and has greater parallelism. This increase in speed is both a consequence of many of the factors above, but also care and attention.<\/span><\/p>\n<h2><span>The user view<\/span><\/h2>\n<p><span>For end users, Buck2 works mostly the same as Buck1 (which, to a first approximation, is fairly similar to Bazel). A user defines targets in a <\/span><span>BUCK<\/span><span> file:<\/span><\/p>\n<p>rust_binary(<br \/>\n\u00a0\u00a0\u00a0\u00a0name = \u201cmy_binary\u201d,<br \/>\n\u00a0\u00a0\u00a0\u00a0srcs = [\u201cmain.rs\u201d],<br \/>\n\u00a0\u00a0\u00a0\u00a0deps = [\u201c:my_library\u201d],<br \/>\n)<\/p>\n<p><span>A user can then build with <\/span><span>buck2 build <span>\/\/:my_binary<\/span><\/span><span>. The value <\/span><span>main.rs<\/span><span> is a source file, and <\/span><span>:my_library<\/span><span> is a dependency defined in the same <\/span><span>BUCK<\/span><span> file. It\u2019s worth noting that Buck2 is mostly compatible with the <\/span><span>BUCK<\/span><span> files of Buck1.\u00a0<\/span><\/p>\n<p><span>As well as the increase in speed, there are two major additional user-visible differences compared to Buck1.<\/span><\/p>\n<p><span>First, the console output has been redesigned on top of the <\/span><a href=\"https:\/\/developers.facebook.com\/blog\/post\/2022\/07\/21\/superconsole\/\" target=\"_blank\" rel=\"noopener\"><span>Superconsole library<\/span><\/a><span>, which we specifically developed for Buck2. The console shows a few more details and feels a lot nicer to use:<\/span><\/p>\n\n<p><span>Second, there is a persistent daemon that maintains a single dependency graph. When you change a <\/span><span>BUCK<\/span><span> file, a dependency, or a source file, we invalidate the appropriate things on the dependency graph, then request the output artifacts per the command line. In Buck1 there are multiple distinct dependency graphs, which result in phases like target graph construction, action graph construction, and then action graph execution. There are also some operations that aren\u2019t performed on the graph. If certain things change in Buck1, then entire graphs are thrown away, rather than the minimum pieces being invalidated. With a single dependency graph, Buck2 is simpler, avoids more redundant work, and avoids explicit phases. Everything on the dependency graph has a key (how it is identified) and a value, along with a function to compute the value from the key and other related keys (following the model in the paper, \u201c<\/span><a href=\"https:\/\/ndmitchell.com\/#shake_21_apr_2020\" target=\"_blank\" rel=\"noopener\"><span>Build Systems a la Carte\u201d<\/span><\/a><span>).<\/span><\/p>\n<h2><span>The rule author view<\/span><\/h2>\n<p><span>While the user model follows Buck1 very closely, the approach for rules is completely different. There are lots of rules in Buck, for example <\/span><span>rust_binary<\/span><span> used above. While a rule in Buck1 was a Java class, baked into Buck1, a rule in Buck2 is entirely decoupled. Buck2 also ships with a \u201cprelude\u201d of rules that implement most of the Buck1 rules.\u00a0<\/span><\/p>\n<p><span>Buck1 rules were tuned over time, had lots of performance optimizations and powerful features like graph traversal, but those rules were also expected to obey a lot of complex invariants<\/span><span>\u2014<\/span><span>sometimes breaking those rules. For Buck2, the rule API is entirely in Starlark, which forced us to abstract those features as generically reusable APIs, aiming to make them safe, expressive, and performant<\/span><span>\u2014<\/span><span>a tricky balance. We\u2019ll touch on two such examples.<\/span><\/p>\n<h3><span>OCaml dependencies<\/span><\/h3>\n<p><span>The dependency structure of the OCaml library is hard to express in Buck1. An OCaml library consists of a number of OCaml files. These must be compiled in dependency order<\/span><span>\u2014<\/span><span>so if <\/span><span>A.ml<\/span><span> uses <\/span><span>B.ml<\/span><span>, you must compile <\/span><span>B.ml<\/span><span> first. Bazel requires the dependency of <\/span><span>A.ml<\/span><span> on <\/span><span>B.ml<\/span><span> to be written explicitly in the <\/span><span>BUCK<\/span><span> file. Buck1 and Buck2 both leave that internal dependency implicit and run the tool <\/span><span>ocamldep<\/span><span> to infer it, which requires less maintenance as the structure changes.\u00a0 What Buck1 did is run <\/span><span>ocamldep<\/span><span> just after parsing the <\/span><span>BUCK<\/span><span> file, which wasn\u2019t really allowed, and it didn\u2019t track dependencies, so if you changed the imports too much Buck1 gave spurious compilation failures. With Buck2, we can use the <\/span><a href=\"https:\/\/buck2.build\/docs\/rule_authors\/dynamic_dependencies\/\" target=\"_blank\" rel=\"noopener\"><span>new primitive <\/span><span>dynamic_output<\/span><\/a><span>, which lets you run a command, read the output of the file, then wire up the rest of the graph<\/span><span>\u2014<\/span><span>putting in the correct dependencies between the <\/span><span>.ml<\/span><span> files automatically.<\/span><\/p>\n<h3><span>C++ link dependencies<\/span><\/h3>\n<p><span>Consider the C++ linking model:<\/span> <span>To produce a library, you usually need to link together its build output, along with the transitive closure of the build output of its dependencies. If you simply duplicate the set of dependencies at each layer as you move up the graph, you end up with <\/span><span>O(n<\/span><span>2<\/span><span>)<\/span><span> memory usage. In Buck1, there was custom code in many rules to capture this pattern, relying on the ability to share Java values in memory and for the dependencies to be represented in place within the rule structure (as there was no reified dependency graph). In Buck2, there are much stronger abstraction boundaries, so such reuse has to be made more explicit. Therefore, we <\/span><a href=\"https:\/\/buck2.build\/docs\/rule_authors\/transitive_sets\/\" target=\"_blank\" rel=\"noopener\"><span>introduced transitive-sets (tsets)<\/span><\/a><span> to capture this pattern of sets representing a transitive closure. By making tsets more abstract, we were also able to wire the tset directly into the underlying dependency graph, meaning this representation is efficient in both memory and computation time.<\/span><\/p>\n<h2><span>Try Buck2 now<\/span><\/h2>\n<p><span>We\u2019re keen for people to give Buck2 a try, and we would be happy to hear any feedback (<\/span><a href=\"https:\/\/github.com\/facebookincubator\/buck2\/issues\" target=\"_blank\" rel=\"noopener\"><span>GitHub issues<\/span><\/a><span> are the best way). We expect Buck2 will be most interesting to moderately sized multi-language projects. Visit the <\/span><a href=\"https:\/\/buck2.build\/docs\/getting_started\/\" target=\"_blank\" rel=\"noopener\"><span>Buck2 getting started page<\/span><\/a><span> for more information.<\/span><\/p>\n<p>The post <a href=\"https:\/\/engineering.fb.com\/2023\/04\/06\/open-source\/buck2-open-source-large-scale-build-system\/\">Build faster with Buck2: Our open source build system<\/a> appeared first on <a href=\"https:\/\/engineering.fb.com\/\">Engineering at Meta<\/a>.<\/p>\n<p>Engineering at Meta<\/p>","protected":false},"excerpt":{"rendered":"<p>Buck2, our new open source, large-scale build system, is now available on GitHub. Buck2 is an extensible and performant build system written in Rust and designed to make your build experience faster and more efficient.\u00a0 In our internal tests at Meta, we observed that Buck2 completed builds 2x as fast as Buck1. Buck2, Meta\u2019s open&hellip; <a class=\"more-link\" href=\"https:\/\/fde.cat\/index.php\/2023\/04\/06\/build-faster-with-buck2-our-open-source-build-system\/\">Continue reading <span class=\"screen-reader-text\">Build faster with Buck2: Our open source build system<\/span><\/a><\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":""},"categories":[7],"tags":[],"class_list":["post-697","post","type-post","status-publish","format-standard","hentry","category-technology","entry"],"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":776,"url":"https:\/\/fde.cat\/index.php\/2023\/10\/23\/5-things-you-didnt-know-about-buck2\/","url_meta":{"origin":697,"position":0},"title":"5 Things you didn\u2019t know about Buck2","date":"October 23, 2023","format":false,"excerpt":"Meta has a very large monorepo, with many\u00a0 different programming languages. To optimize build and performance, we developed our own build system called Buck, which was first open-sourced in 2013.\u00a0 Buck2 is the recently open-sourced successor. In our internal tests at Meta, we observed that Buck2 completed builds approximately 2x\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":728,"url":"https:\/\/fde.cat\/index.php\/2023\/06\/27\/meta-developer-tools-working-at-scale\/","url_meta":{"origin":697,"position":1},"title":"Meta developer tools: Working at scale","date":"June 27, 2023","format":false,"excerpt":"Every day, thousands of developers at Meta are working in repositories with millions of files. Those developers need tools that help them at every stage of the workflow while working at extreme scale. In this article we\u2019ll go through a few of the tools in the development process. And, as\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":820,"url":"https:\/\/fde.cat\/index.php\/2024\/02\/06\/dotslash-simplified-executable-deployment\/","url_meta":{"origin":697,"position":2},"title":"DotSlash: Simplified executable deployment","date":"February 6, 2024","format":false,"excerpt":"We\u2019ve open sourced DotSlash, a tool that makes large executables available in source control with a negligible impact on repository size, thus avoiding I\/O-heavy clone operations. With DotSlash, a set of platform-specific executables is replaced with a single script containing descriptors for the supported platforms. DotSlash handles transparently fetching, decompressing,\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":818,"url":"https:\/\/fde.cat\/index.php\/2024\/01\/29\/improving-machine-learning-iteration-speed-with-faster-application-build-and-packaging\/","url_meta":{"origin":697,"position":3},"title":"Improving machine learning iteration speed with faster application build and packaging","date":"January 29, 2024","format":false,"excerpt":"Slow build times and inefficiencies in packaging and distributing execution files were costing our ML\/AI engineers a significant amount of time while working on our training stack. By addressing these issues head-on, we were able to reduce this overhead by double-digit percentages.\u00a0 In the fast-paced world of AI\/ML development, it\u2019s\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":651,"url":"https:\/\/fde.cat\/index.php\/2022\/11\/15\/sapling-source-control-thats-user-friendly-and-scalable\/","url_meta":{"origin":697,"position":4},"title":"Sapling: Source control that\u2019s user-friendly and scalable","date":"November 15, 2022","format":false,"excerpt":"Sapling is a new Git-compatible source control client. Sapling emphasizes usability while also scaling to the largest repositories in the world. ReviewStack is a demonstration code review UI for GitHub pull requests that integrates with Sapling to make reviewing stacks of commits easy. You can get started using Sapling today.\u00a0\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":836,"url":"https:\/\/fde.cat\/index.php\/2024\/03\/12\/building-metas-genai-infrastructure\/","url_meta":{"origin":697,"position":5},"title":"Building Meta\u2019s GenAI Infrastructure","date":"March 12, 2024","format":false,"excerpt":"Marking a major investment in Meta\u2019s AI future, we are announcing two 24k GPU clusters. We are sharing details on the hardware, network, storage, design, performance, and software that help us extract high throughput and reliability for various AI workloads. We use this cluster design for Llama 3 training. We\u2026","rel":"","context":"In &quot;Technology&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/697","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/comments?post=697"}],"version-history":[{"count":0,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/posts\/697\/revisions"}],"wp:attachment":[{"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/media?parent=697"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/categories?post=697"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fde.cat\/index.php\/wp-json\/wp\/v2\/tags?post=697"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}