LLVM Developers' MeetingTable of ContentsAboutProgramTalk AbstractsOpenMP Support in Clang/LLVM: Status Update and Future DirectionsAlive: Provably Correct InstCombine OptimizationsLLVM Stable ReleasesSupporting Precise Relocating Garbage Collection in LLVMSource Code Analysis for Security through LLVMFTL: WebKit's LLVM based JITVirtual Ghost: Using LLVM to Protect Applications from a Compromised Operating SystemWhat does it take to make LLVM as performant as GCCBlowing up the Atomic BarrierThe LLVM Pass Manager, Part 2Supporting Vector Programming on a Bi-Endian Processor ArchitectureIntrinsics, Metadata and Attributes: Now, more than ever!Implementing Data Layout Optimizations in LLVM FrameworkFuzzing Clang to Find ABI BugsA closer look at ARM code qualityImplementation of global instruction scheduling in LLVM infrastructureSkip the FFI: Embedding Clang for C InteroperabilityCustom Hardware State-Machines and Datapaths: Using LLVM to Generate FPGA AcceleratorsControlling Virtual Register Pressure in LLVM Middle-EndIndexing Large, Mixed-Language CodebasesArchitecture Specific Code Generation and Function MultiversioningFrappé: Using Clang to Query and Visualize Large CodebasesLightning Talk AbstractsAutomatic Loop Diagonalization with LLVMPBQP register allocationPBQP register allocation in the wildLink-Time Optimization on PlayStation(R)4Software Visualizer (SWViz): A tool for visually exploring Linux kernel with ClangLLVM for Interactive Modeling and High Performance SimulationAddressSanitizer for WindowsTutorial AbstractsBuilding an LLVM BackendAdding and Optimizing a Subtarget for MISchedulerDebug Info TutorialBoF AbstractsJIT Support in LLVMPerformance tracking & benchmarking infrastructureDebug Info BOFImproving LLVM for remote test executionFuture directions and features for LLDBGPU Implementers BoFLLVM Inliner ImprovementsLTOLldPoster AbstractsLLVM for Interactive Modeling and High Performance SimulationISPC: clang-based front-endA C++ ABI Test SuiteSoftware Visualizer (SWViz): A tool for visually exploring Linux kernel with ClangMachine Guided Compilation and Compiling to Minimize Energy UsageTranslating Java into LLVM IR to Detect Security VulnerabilitiesIntel® AVX-512 architecture evolution and support in Clang/LLVMmcsemaCarte++: A LLVM-Based Compiler Targeting FPGAsLogisticsConference Dates: October 28-29, 2014Location: DoubleTree by Hilton, San Jose, CAEvent Site: https://llvm.org/devmtg/2014-10/AboutThe LLVM Foundation announces the eighth annual bay area LLVM Developers’ Meeting will be held October 28th and 29th in San Jose, CA.This year the conference will be 2 full days that include technical talks, BoFs, hacker’s lab, tutorials, and a poster session. Attendance will be capped at 300.The meeting serves as a forum for LLVM, Clang, LLDB, and other LLVM project developers and users to get acquainted, learn how LLVM is used, and exchange ideas about LLVM and its (potential) applications. More broadly, we believe the event will be of particular interest to the following people:Active developers of projects in the LLVM Umbrella (LLVM core, Clang, LLDB, libc++, compiler_rt, klee, dragonegg, lld, etc).Anyone interested in using these as part of another project.Compiler, programming language, and runtime enthusiasts.Those interested in using compiler and toolchain technology in novel and interesting ways.ProgramTalk AbstractsOpenMP Support in Clang/LLVM: Status Update and Future DirectionsAlexey Bataev - Intel, Zinovy Nis - Intel [Slides] [Video]OpenMP is a well-known and widely used API for shared-memory parallelism. Support for OpenMP in Clang/LLVM compiler is currently under development. In this talk, we will present current status of OpenMP support, what is done and what remains to be done, technical details behind OpenMP implementation. Also, we will elaborate on accelerators and pragma-assisted SIMD vectorization, introduced in the latest 4.0 edition of the OpenMP standard.Alive: Provably Correct InstCombine OptimizationsNuno Lopes - Microsoft Research, David Menendez - Rutgers University, Santosh Nagarakatte - Rutgers University, John Regehr - University of Utah [Slides] [Video]Optimizations are hard to get right. Even seemingly innocuous transformations in InstCombine can miss important corner cases. With Alive, you can specify peephole optimizations in a friendly, LLVM-like language, automatically determine their correctness, and generate the corresponding C++ code.LLVM Stable ReleasesTom Stellard - Advanced Micro Devices [Slides] [Video]This talk will cover LLVM stable releases, how they work, who uses them, and how we can make them better.Supporting Precise Relocating Garbage Collection in LLVMPhilip Reames - Azul Systems, Sanjoy Das - Azul Systems [Slides] [Video]Generating efficient code that is compatible with common high performance garbage collector implementations will strengthen LLVM's ability to support languages with managed runtime environments. To support this common use case, we have built and are in the process of contributing a safepoint insertion pass which can rewrite optimized IR into a form which respects the invariants required by a fully relocating garbage collector, and a set of intrinsics which enable work towards efficient lowering of safepoints.Source Code Analysis for Security through LLVMLu Zhao - HP Fortify [Slides] [Video]We added a new debug mode for the Clang compiler which emits extra metadata in the LLVM bitcode. The metadata has turned the bitcode into a useful vehicle from which an intermediate representation for secure source code analysis can be derived. We have used this approach to find security vulnerabilities in the Objective-C source code.FTL: WebKit's LLVM based JITAndrew Trick - Apple, Juergen Ributzka - Apple [Slides] [Video]FTL is the fourth-tier LLVM JIT that powers JavaScript in WebKit. We will talk about our experiences using LLVM to build this high-performance JIT. We will explain the motivation for new LLVM features, including patchpoints and a new form of stack maps, and will share our vision on future work and the direction we would like LLVM move to become a better platform for JIT clients.Virtual Ghost: Using LLVM to Protect Applications from a Compromised Operating SystemJohn Criswell - University of Rochester [Slides] [Video]This talk presents an LLVM-based system named Virtual Ghost that protects applications from a compromised operating system kernel. Virtual Ghost uses compiler instrumentation to protect application data from spying and corruption from the operating system kernel. It also uses an extended version of the LLVM instruction set to restrict how the operating system kernel can interact with the hardware, preventing the operating system kernel from using hardware configuration to corrupt application control-flow or to bypass the application data protection guarantees.What does it take to make LLVM as performant as GCCJames Molloy - ARM, Ana Pazos - QuIC, Yin Ma - QuIC [Slides] [Video]For the past 7 months Qualcomm and ARM have jointly been analyzing and improving performance for the AArch64 architecture in LLVM, based on a differential analysis against GCC. This talk aims to provide information on the areas that we're currently lacking compared to GCC, along with the progress that we've made so far.Blowing up the Atomic BarrierRobin Morisset - Google, JF Bastien - Google [Slides] [Video]Atomics in C11 and C++11 let the programmer express the guarantees needed for racy accesses in lock-free code, in theory bringing a zero-cost abstraction for parallelism to the language. This talk will showcase how you can use atomics today and where the abstraction breaks down. We'll focus on LLVM's recent improvements for atomics that provide significant performance gains on ARMv7, Power and x86. Finally we'll discuss some extremely non-intuitive behaviors of atomics, how atomics in C++ may evolve, and how it may impact LLVM.The LLVM Pass Manager, Part 2Chandler Carruth - Google [Slides] [Video]I will present a new design and implementation of the LLVM pass manager that is currently being developed in the tree. I will cover how it differs from the previous implementation and how those differences allow it to solve many of the long-standing limitations of the current implementation. I will discuss in depth how the new system supports richer analysis dependencies, flexible composition of function-level analyses and module- or call-graph-level transformations.Supporting Vector Programming on a Bi-Endian Processor ArchitectureBill Schmidt - IBM, Michael Gschwind - IBM [Slides] [Video]The POWER instruction set architecture is designed to support both big-endian and little-endian memory models. However, many of the instructions designed for vector support assume that vector elements in registers appear in big endian order, that is, with the lowest-numbered vector element in the most significant portion of the register. This talk will outline some of the issues faced in designing a sensible vector programming model on a bi-endian architecture with a big-endian bias, and how we've addressed them.Intrinsics, Metadata and Attributes: Now, more than ever!Hal Finkel - Argonne National Laboratory [Slides] [Video]Over the past year, LLVM has grown several new ways to communicate important information to the optimizer: An @llvm.assume intrinsic function to provided additional truths, scoped-noalias metadata to provided explicit pointer aliasing sets, and parameter attributes that specify pointer alignment, dereferenceability, and more. I'll explain the semantics of many of these new features, their intended uses, and a few ways they shouldn't be used.Implementing Data Layout Optimizations in LLVM FrameworkPrashantha NR - Compiler Tree Technologies, Vikram TV - Compiler Tree Technologies, Vaivaswatha N - Compiler Tree Technologies [Slides] [Video]Modern server workloads are limited by memory bandwidth. For regular accesses like loops, people change the loop iterations to change the access pattern; thereby gaining locality. Another way to alleviate the memory bottleneck is to change the data layout organization for better locality. In this talk we will speak about memory layout optimizations like Structure Splitting, Instance Interleaving, Struct Array copy, Array Remapping in LLVM compiler framework.Fuzzing Clang to Find ABI BugsDavid Majnemer - Google [Slides] [Video]Correctly implementing C++ is very hard, bugs can arise from incredibly subtle interactions of different language features. In this talk, we will discuss how we used fuzzing to dramatically increase the reliability of Clang's ABI-specific code.A closer look at ARM code qualityTilmann Scheller - Samsung Electronics [Slides] [Video]This talk presents current performance numbers for the SPEC CPU benchmark suites on ARM, comparing the performance of LLVM and GCC, with the main focus on the SPEC CPU integer benchmarks. To dive a little bit deeper, we will also have a closer look at the generated assembly code of selected benchmarks where LLVM is performing worse than GCC. We will use the results of this performance analysis to point out potential code generation opportunities for LLVM.Implementation of global instruction scheduling in LLVM infrastructureSergei Larin - QuIC, Aditya Kumar - QuIC [Slides] [Video]Discuss perspectives and tradeoffs in implementation of global instruction scheduling and support for it in the LLVM infrastructure. Present and discuss relative QuIC experience.Skip the FFI: Embedding Clang for C InteroperabilityJordan Rose - Apple, John McCall - Apple [Slides] [Video]Most languages that aren't a superset of C provide a Foreign Function Interface (FFI) that allows one to interface with existing C libraries. FFIs are often an afterthought, requiring manual or source-to-source translation from C header files to a subset of the target language, resulting in complicated build processes, frequent manual tweaking, and numerous implementation challenges. This talk will discuss an alternative approach that embeds Clang into an LLVM-based compiler front end to provide C compatibility without the traditional FFI.Custom Hardware State-Machines and Datapaths: Using LLVM to Generate FPGA AcceleratorsAlan Baker - Altera Corporation [Slides] [Video]Altera Corporation's OpenCL compiler uses LLVM to generate FPGA accelerators. In order to generate efficient hardware, many transformations were implemented. The key transformations will be discussed, emphasizing the FPGA vs CPU architectural differences that motivate them.Controlling Virtual Register Pressure in LLVM Middle-EndIvan Baev - QuIC [Slides] [Video]Enabling new compiler optimizations or compiling at higher optimization levels do not necessarily improve performance. One common reason is the increased register pressure that results in an increased amount of spill code. We analyze several LLVM target-independent optimizations with respect to register pressure. We then describe algorithms for controlling register pressure in LICM and GVN and report their positive impact on run-time performance.Indexing Large, Mixed-Language CodebasesJames Dennett - Google, Luke Zarko - Google [Slides] [Video]The Kythe project aims to establish open data formats and protocols for interoperable developer tools. In this talk, we will introduce the Kythe model as it applies to C++14, concentrating on features required for generating cross-references. In the process, we will discuss how the Clang front-end was instrumental in developing an indexing tool that produces Kythe data describing C++ source code.Architecture Specific Code Generation and Function MultiversioningEric Christopher - Google [Slides] [Video]A talk on microarchitecture dependent optimization and code generation for individual functions, the changes that have been necessary to enable it, and function multiversioning as the next step.Frappé: Using Clang to Query and Visualize Large CodebasesNathan Hawes - Oracle, Ben Barham - Oracle [Slides] [Video]Frappé is a new tool to support developers with a range of code comprehension queries in multi-million line codebases, from 'Does function X or something it calls write to global variable Y?' to 'How much code could be affected if I change this macro?'. Results are overlaid on a visualisation of the code based on a cartographic map, where the continent/country/state hierarchy corresponds to the code equivalent: high-level architectural components down to individual files and functions.Lightning Talk AbstractsAutomatic Loop Diagonalization with LLVMVedant Kumar - UC Berkeley [Slides]The effects of some loops can be captured by matrices. This talk shows how to take advantage of this fact to optimize away entire loops.PBQP register allocationArnaud de Grandmaison - ARM, Lang Hames - Apple [Slides]The PBQP register allocator has been part of the LLVM source code for a long time. This talk will present a high level overview of the PBQP allocator principles, as well as its status and future work.PBQP register allocation in the wildArnaud de Grandmaison - ARM, Lang Hames - Apple [Slides]This talk will present how the PBQP is used with 2 real world processors and report some performance numbers.Link-Time Optimization on PlayStation(R)4Yunzhong Gao - Sony Computer Entertainment America [Slides]As a follow-up to last year's presentation, this is a talk on our experience implementing LTO on PlayStation(R)4's proprietary linker, including some of the challenges and results we see in the games.Software Visualizer (SWViz): A tool for visually exploring Linux kernel with ClangHarsh Vardhan Dwivedi - QuIC [Slides]SWViz is a tool to explore the call graph of a program. It's chief advantage being able to leverage the linker and compiler (Clang) to generate accurate call-graphs. Through use of Swviz one can quickly gain an understanding of how a program is working. This leads to a massive cutdown in number of hours spent understanding the call-flow of any mature codebase.LLVM for Interactive Modeling and High Performance SimulationPeng Cheng - The MathWorks, Inc. [Slides]At MathWorks, LLVM has been used to create multi-thread JIT engines for interactive modeling and high performance simulation. This talk will present why and how LLVM is used to implement the JIT engines as well as some challenges.AddressSanitizer for WindowsTimur Iskhodzhanov - Google [Slides]A fast memory error detector, now available to try on Windows. Brief overview of the internals and some initial deployment results.Tutorial AbstractsBuilding an LLVM BackendFraser Cormack - Codeplay Software, Pierre-André Saulais - Codeplay Software [Slides] [Video]This talk explains how to get started with building a LLVM backend for a new architecture. It shows how LLVM transforms programs through the back-end compilation pipeline and what needs implementing for a new target. Practical debugging tips, as well as solutions to common issues are given. No LLVM backend experience is needed, but experience with LLVM IR is recommended.Adding and Optimizing a Subtarget for MISchedulerDave Estes - QuIC [Slides] [Video]Tutorial for adding a subtarget to an existing backend for use with the MIScheduler. Will cover TableGen basics insofar as understanding the records used for the MachineSchedModel. Will provide strategies on how to best model some basic machine architectures.Debug Info TutorialEric Christopher - Google, David Blaikie - Google [Slides] [Video]Take a walk through the DWARF debug information format and the llvm APIs that serve as an interface to emitting debug information for your language.BoF AbstractsJIT Support in LLVMLang Hames - AppleA forum for clients and developers of LLVM's JIT infrastructure to discuss APIs, features, and intrinsic support (the stackmap and patchpoint intrinsics).Performance tracking & benchmarking infrastructureKristof Beyls - ARM, Chad Rosier - QuIC, Chris Matthew - Apple, Tobias Grosser - ETH, Renato Golin - LinaroHaving at least some public performance tracking that the majority of the community cares about would make it easier to collaborate for all developers improving the quality of LLVM-generated code. During last year's BoF, we identified some key shortcomings in LNT to be able to produce low-noise performance numbers on the test-suite and most of them have been fixed recently.Debug Info BOFEric Christopher - GoogleThere's been quite a bit of interest in a Debug Info BOF following the one we had last year at the 2013 Developer Conference. This BOF will cover everything from scaling our current handling of debug information, to ongoing work on debug information correctness, and future proposals for DWARF standardization.Improving LLVM for remote test executionBrian Rzycki - Samsung Austin R&D CenterToday's LLVM testing frameworks focus on testing native compilers on the same host where the test is compiled. The goal of this BoF is to examine the components necessary for change and to discuss potential solutions that allow for running tests on remote hosts in a way that best benefits the overall community.Future directions and features for LLDBDeepak Mathews Panickal - Codeplay SoftwareGPU Implementers BoFTom Stellard - AMD, Owen Anderson - AppleLLVM is rapidly gaining popularity as a compilation framework for graphics processors. This Birds of a Feather session will focus on issues of interest to implementers of GPU targets in LLVM. Topics of discussion may include:LLVM Inliner ImprovementsYin Ma - QuIC, Ana Pazos - QuIC, Jiangling Liu - ARMDiscuss the opportunities to extend the LLVM inliner and our work greedy inliner.LTOTony Linthicum - QuIC, Dan Palermo - QuICDiscuss LTO's current state and potential future improvements. Potential topics include, but are not limited to, the following: passing command line arguments to LTO, profile driven LTO optimizations, compile time improvements and performance improvements.LldShankar Easwaran - QuIC, Daniel Stewart - QuICThe lld linker is a solid foundation for a general-purpose linker, as well as a set of libraries for creating linker-like tools. We will talk about the current status of lld, what the shortcomings are for a production linker, and what major areas need to be implemented, such as LTO support, diagnostics, extensibility.Poster AbstractsLLVM for Interactive Modeling and High Performance SimulationPeng Cheng; Nathan Brewton; Dale Martin - The MathWorks, Inc. [Slides]To enable inter-module optimization during interactive modeling for high performance simulation of Simulink models, LLVM based JIT compilers have been developed at MathWorks. These JIT compilers support multiple threads on major platforms, including win32, win64, glnax64, and maci64, and achieve superior running time performance compared with previous shared library based compilers. This talk will present why and how LLVM is used to implement the JIT compilers as well as some challenges.ISPC: clang-based front-endDmitry Babokin - Intel Corporation; James Brodman - Intel Corporation [Slides]ISPC is a C-based language based on the SPMD (single program, multiple data) programming model that generates efficient SIMD code for modern processors without the need for complex analysis and autovectorization. The project uses the LLVM infrastructure for optimization and code generation but originally used a custom front-end. The poster describes our experience with building a clang-based front-end and the engineering problems we have encountered introducing the concept of “varying” types to clang.A C++ ABI Test SuiteSunil Srivastava - Sony Computer Entertainment [Slides]This poster describes the design of an Itanium C++ ABI Test Suite that verifies a compiler’s compliance against the ABI specification to ensure link compatibility.Software Visualizer (SWViz): A tool for visually exploring Linux kernel with ClangHarsh Vardhan Dwivedi - QuIC [Slides]What is SWviz and why should I care about it anyway? SWViz is a tool to explore the call graph of a program. It’s chief advantage being able to leverage the linker and compiler (Clang) to generate accurate call-graphs. Through use of Swviz one can quickly gain an understanding of how a program is working. This leads to a massive cutdown in number of hours spent understanding the call-flow of any mature codebase. We’ll demonstrate use of SWViz with Linux Kernel.Machine Guided Compilation and Compiling to Minimize Energy UsageSimon Cook - Embecosm; Ed Jones - Embecosm [Slides]Today we need compilers to optimize for energy rather than just size or speed. In this poster we present the results of integrating machine learning with LLVM, so the compiler can be trained on which optimizations minimize energy. As a side effect of energy optimization we also see significant performance benefits - nearly doubling performance compared to -O3 in some cases.Translating Java into LLVM IR to Detect Security VulnerabilitiesCristina Cifuentes - Oracle Labs Australia, Oracle; Nathan Keynes - Parfait, Oracle; John Gough - Oracle Labs Australia, Oracle; Diane Corney - Oracle Labs Australia, Oracle; Lin Gao - Parfait, Oracle; Manuel Valdiviezo - Parfait, Oracle; Andrew Gross - Java Security, Oracle [Slides]This poster describes one of several new Java security vulnerabilities and how we reused and extended LLVM’s IR in order to detect such vulnerability. One year later, full support for Java 7 and 8 are in place, along with various analyses that detect Java platform vulnerabilities.Intel® AVX-512 architecture evolution and support in Clang/LLVMZinovy Nis - Intel Corporation; Robert Khasanov - Intel Corporation [Slides]Intel® AVX-512 vector ISA continues to evolve. It has recently been enriched with new groups of instructions operating on different vector lengths and different vector element sizes. We’ll present the current status of new AVX-512 features enabling in CLANG/LLVM and show how these features can be exploited for performance improvements particularly by vectorizer.mcsemaArtem Dinaburg - Trail of Bits, Inc.; Andrew Ruef - Trail of Bits, Inc.; Jay Little - Trail of Bits, Inc. [Slides]mcsema is a framework for analyzing and transforming machine-code programs to LLVM bitcode. It supports translation of x86 machine code, including integer, floating point, and SSE instructions.Carte++: A LLVM-Based Compiler Targeting FPGAsJeffrey Hammes; Lisa Krause; Matthew O’Connor; Jon Steidel - SRC Computers, LLC [Slides]SRC Computers, LLC is presenting a poster on its upcoming Carte++ compiler release showing how the Clang/LLVM infrastructure has provided full language support and a rich set of compiler optimizations upon which to build the Carte++ code generator. The poster will illustrate specific optimizations and compiler challenges unique to SRC's FPGA-based hardware.LogisticsDoubleTree by Hilton San Jose 2050 Gateway Place San Jose, CA 95110You may book their hotel stay by using this link.Rate: $209 plus local taxes for superior double queen or superior king roomsSelf Parking is $19 per day.