LLVM Developers' Meeting

Table of Contents

About
Program
Logistics

Conference Dates: October 28-29, 2014
Location: DoubleTree by Hilton, San Jose, CA
Event Site: https://llvm.org/devmtg/2014-10/

About

The LLVM Foundation announces the eighth annual bay area LLVM Developers’ Meeting will be held October 28th and 29th in San Jose, CA.

This year the conference will be 2 full days that include technical talks, BoFs, hacker’s lab, tutorials, and a poster session. Attendance will be capped at 300.

The meeting serves as a forum for LLVM, Clang, LLDB, and other LLVM project developers and users to get acquainted, learn how LLVM is used, and exchange ideas about LLVM and its (potential) applications. More broadly, we believe the event will be of particular interest to the following people:

Active developers of projects in the LLVM Umbrella (LLVM core, Clang, LLDB, libc++, compiler_rt, klee, dragonegg, lld, etc).
Anyone interested in using these as part of another project.
Compiler, programming language, and runtime enthusiasts.
Those interested in using compiler and toolchain technology in novel and interesting ways.

Program

Talk Abstracts

OpenMP Support in Clang/LLVM: Status Update and Future Directions

Alexey Bataev - Intel, Zinovy Nis - Intel [Slides] [Video]
OpenMP is a well-known and widely used API for shared-memory parallelism. Support for OpenMP in Clang/LLVM compiler is currently under development. In this talk, we will present current status of OpenMP support, what is done and what remains to be done, technical details behind OpenMP implementation. Also, we will elaborate on accelerators and pragma-assisted SIMD vectorization, introduced in the latest 4.0 edition of the OpenMP standard.

Alive: Provably Correct InstCombine Optimizations

Nuno Lopes - Microsoft Research, David Menendez - Rutgers University, Santosh Nagarakatte - Rutgers University, John Regehr - University of Utah [Slides] [Video]
Optimizations are hard to get right. Even seemingly innocuous transformations in InstCombine can miss important corner cases. With Alive, you can specify peephole optimizations in a friendly, LLVM-like language, automatically determine their correctness, and generate the corresponding C++ code.

LLVM Stable Releases

Tom Stellard - Advanced Micro Devices [Slides] [Video]
This talk will cover LLVM stable releases, how they work, who uses them, and how we can make them better.

Supporting Precise Relocating Garbage Collection in LLVM

Philip Reames - Azul Systems, Sanjoy Das - Azul Systems [Slides] [Video]
Generating efficient code that is compatible with common high performance garbage collector implementations will strengthen LLVM's ability to support languages with managed runtime environments. To support this common use case, we have built and are in the process of contributing a safepoint insertion pass which can rewrite optimized IR into a form which respects the invariants required by a fully relocating garbage collector, and a set of intrinsics which enable work towards efficient lowering of safepoints.

Source Code Analysis for Security through LLVM

Lu Zhao - HP Fortify [Slides] [Video]
We added a new debug mode for the Clang compiler which emits extra metadata in the LLVM bitcode. The metadata has turned the bitcode into a useful vehicle from which an intermediate representation for secure source code analysis can be derived. We have used this approach to find security vulnerabilities in the Objective-C source code.

FTL: WebKit's LLVM based JIT

Andrew Trick - Apple, Juergen Ributzka - Apple [Slides] [Video]
FTL is the fourth-tier LLVM JIT that powers JavaScript in WebKit. We will talk about our experiences using LLVM to build this high-performance JIT. We will explain the motivation for new LLVM features, including patchpoints and a new form of stack maps, and will share our vision on future work and the direction we would like LLVM move to become a better platform for JIT clients.

Virtual Ghost: Using LLVM to Protect Applications from a Compromised Operating System

John Criswell - University of Rochester [Slides] [Video]
This talk presents an LLVM-based system named Virtual Ghost that protects applications from a compromised operating system kernel. Virtual Ghost uses compiler instrumentation to protect application data from spying and corruption from the operating system kernel. It also uses an extended version of the LLVM instruction set to restrict how the operating system kernel can interact with the hardware, preventing the operating system kernel from using hardware configuration to corrupt application control-flow or to bypass the application data protection guarantees.

What does it take to make LLVM as performant as GCC

James Molloy - ARM, Ana Pazos - QuIC, Yin Ma - QuIC [Slides] [Video]
For the past 7 months Qualcomm and ARM have jointly been analyzing and improving performance for the AArch64 architecture in LLVM, based on a differential analysis against GCC. This talk aims to provide information on the areas that we're currently lacking compared to GCC, along with the progress that we've made so far.

Blowing up the Atomic Barrier

Robin Morisset - Google, JF Bastien - Google [Slides] [Video]
Atomics in C11 and C++11 let the programmer express the guarantees needed for racy accesses in lock-free code, in theory bringing a zero-cost abstraction for parallelism to the language. This talk will showcase how you can use atomics today and where the abstraction breaks down. We'll focus on LLVM's recent improvements for atomics that provide significant performance gains on ARMv7, Power and x86. Finally we'll discuss some extremely non-intuitive behaviors of atomics, how atomics in C++ may evolve, and how it may impact LLVM.

The LLVM Pass Manager, Part 2

Chandler Carruth - Google [Slides] [Video]
I will present a new design and implementation of the LLVM pass manager that is currently being developed in the tree. I will cover how it differs from the previous implementation and how those differences allow it to solve many of the long-standing limitations of the current implementation. I will discuss in depth how the new system supports richer analysis dependencies, flexible composition of function-level analyses and module- or call-graph-level transformations.

Supporting Vector Programming on a Bi-Endian Processor Architecture

Bill Schmidt - IBM, Michael Gschwind - IBM [Slides] [Video]
The POWER instruction set architecture is designed to support both big-endian and little-endian memory models. However, many of the instructions designed for vector support assume that vector elements in registers appear in big endian order, that is, with the lowest-numbered vector element in the most significant portion of the register. This talk will outline some of the issues faced in designing a sensible vector programming model on a bi-endian architecture with a big-endian bias, and how we've addressed them.

Intrinsics, Metadata and Attributes: Now, more than ever!

Hal Finkel - Argonne National Laboratory [Slides] [Video]
Over the past year, LLVM has grown several new ways to communicate important information to the optimizer: An @llvm.assume intrinsic function to provided additional truths, scoped-noalias metadata to provided explicit pointer aliasing sets, and parameter attributes that specify pointer alignment, dereferenceability, and more. I'll explain the semantics of many of these new features, their intended uses, and a few ways they shouldn't be used.

Implementing Data Layout Optimizations in LLVM Framework

Prashantha NR - Compiler Tree Technologies, Vikram TV - Compiler Tree Technologies, Vaivaswatha N - Compiler Tree Technologies [Slides] [Video]
Modern server workloads are limited by memory bandwidth. For regular accesses like loops, people change the loop iterations to change the access pattern; thereby gaining locality. Another way to alleviate the memory bottleneck is to change the data layout organization for better locality. In this talk we will speak about memory layout optimizations like Structure Splitting, Instance Interleaving, Struct Array copy, Array Remapping in LLVM compiler framework.

Fuzzing Clang to Find ABI Bugs

David Majnemer - Google [Slides] [Video]
Correctly implementing C++ is very hard, bugs can arise from incredibly subtle interactions of different language features. In this talk, we will discuss how we used fuzzing to dramatically increase the reliability of Clang's ABI-specific code.

A closer look at ARM code quality

Tilmann Scheller - Samsung Electronics [Slides] [Video]
This talk presents current performance numbers for the SPEC CPU benchmark suites on ARM, comparing the performance of LLVM and GCC, with the main focus on the SPEC CPU integer benchmarks. To dive a little bit deeper, we will also have a closer look at the generated assembly code of selected benchmarks where LLVM is performing worse than GCC. We will use the results of this performance analysis to point out potential code generation opportunities for LLVM.

Implementation of global instruction scheduling in LLVM infrastructure

Sergei Larin - QuIC, Aditya Kumar - QuIC [Slides] [Video]
Discuss perspectives and tradeoffs in implementation of global instruction scheduling and support for it in the LLVM infrastructure. Present and discuss relative QuIC experience.

Skip the FFI: Embedding Clang for C Interoperability

Jordan Rose - Apple, John McCall - Apple [Slides] [Video]
Most languages that aren't a superset of C provide a Foreign Function Interface (FFI) that allows one to interface with existing C libraries. FFIs are often an afterthought, requiring manual or source-to-source translation from C header files to a subset of the target language, resulting in complicated build processes, frequent manual tweaking, and numerous implementation challenges. This talk will discuss an alternative approach that embeds Clang into an LLVM-based compiler front end to provide C compatibility without the traditional FFI.

Custom Hardware State-Machines and Datapaths: Using LLVM to Generate FPGA Accelerators

Alan Baker - Altera Corporation [Slides] [Video]
Altera Corporation's OpenCL compiler uses LLVM to generate FPGA accelerators. In order to generate efficient hardware, many transformations were implemented. The key transformations will be discussed, emphasizing the FPGA vs CPU architectural differences that motivate them.

Controlling Virtual Register Pressure in LLVM Middle-End

Ivan Baev - QuIC [Slides] [Video]
Enabling new compiler optimizations or compiling at higher optimization levels do not necessarily improve performance. One common reason is the increased register pressure that results in an increased amount of spill code. We analyze several LLVM target-independent optimizations with respect to register pressure. We then describe algorithms for controlling register pressure in LICM and GVN and report their positive impact on run-time performance.

Indexing Large, Mixed-Language Codebases

James Dennett - Google, Luke Zarko - Google [Slides] [Video]
The Kythe project aims to establish open data formats and protocols for interoperable developer tools. In this talk, we will introduce the Kythe model as it applies to C++14, concentrating on features required for generating cross-references. In the process, we will discuss how the Clang front-end was instrumental in developing an indexing tool that produces Kythe data describing C++ source code.

Architecture Specific Code Generation and Function Multiversioning

Eric Christopher - Google [Slides] [Video]
A talk on microarchitecture dependent optimization and code generation for individual functions, the changes that have been necessary to enable it, and function multiversioning as the next step.

Frappé: Using Clang to Query and Visualize Large Codebases

Nathan Hawes - Oracle, Ben Barham - Oracle [Slides] [Video]
Frappé is a new tool to support developers with a range of code comprehension queries in multi-million line codebases, from 'Does function X or something it calls write to global variable Y?' to 'How much code could be affected if I change this macro?'. Results are overlaid on a visualisation of the code based on a cartographic map, where the continent/country/state hierarchy corresponds to the code equivalent: high-level architectural components down to individual files and functions.

Lightning Talk Abstracts

Automatic Loop Diagonalization with LLVM

Vedant Kumar - UC Berkeley [Slides]
The effects of some loops can be captured by matrices. This talk shows how to take advantage of this fact to optimize away entire loops.

PBQP register allocation

Arnaud de Grandmaison - ARM, Lang Hames - Apple [Slides]
The PBQP register allocator has been part of the LLVM source code for a long time. This talk will present a high level overview of the PBQP allocator principles, as well as its status and future work.

PBQP register allocation in the wild

Arnaud de Grandmaison - ARM, Lang Hames - Apple [Slides]
This talk will present how the PBQP is used with 2 real world processors and report some performance numbers.

Link-Time Optimization on PlayStation(R)4

Yunzhong Gao - Sony Computer Entertainment America [Slides]
As a follow-up to last year's presentation, this is a talk on our experience implementing LTO on PlayStation(R)4's proprietary linker, including some of the challenges and results we see in the games.

Software Visualizer (SWViz): A tool for visually exploring Linux kernel with Clang

Harsh Vardhan Dwivedi - QuIC [Slides]
SWViz is a tool to explore the call graph of a program. It's chief advantage being able to leverage the linker and compiler (Clang) to generate accurate call-graphs. Through use of Swviz one can quickly gain an understanding of how a program is working. This leads to a massive cutdown in number of hours spent understanding the call-flow of any mature codebase.

LLVM for Interactive Modeling and High Performance Simulation

Peng Cheng - The MathWorks, Inc. [Slides]
At MathWorks, LLVM has been used to create multi-thread JIT engines for interactive modeling and high performance simulation. This talk will present why and how LLVM is used to implement the JIT engines as well as some challenges.

AddressSanitizer for Windows

Timur Iskhodzhanov - Google [Slides]
A fast memory error detector, now available to try on Windows. Brief overview of the internals and some initial deployment results.

Tutorial Abstracts

Building an LLVM Backend

Fraser Cormack - Codeplay Software, Pierre-André Saulais - Codeplay Software [Slides] [Video]
This talk explains how to get started with building a LLVM backend for a new architecture. It shows how LLVM transforms programs through the back-end compilation pipeline and what needs implementing for a new target. Practical debugging tips, as well as solutions to common issues are given. No LLVM backend experience is needed, but experience with LLVM IR is recommended.

Adding and Optimizing a Subtarget for MIScheduler

Dave Estes - QuIC [Slides] [Video]
Tutorial for adding a subtarget to an existing backend for use with the MIScheduler. Will cover TableGen basics insofar as understanding the records used for the MachineSchedModel. Will provide strategies on how to best model some basic machine architectures.

Debug Info Tutorial

Eric Christopher - Google, David Blaikie - Google [Slides] [Video]
Take a walk through the DWARF debug information format and the llvm APIs that serve as an interface to emitting debug information for your language.

BoF Abstracts

JIT Support in LLVM

Lang Hames - Apple
A forum for clients and developers of LLVM's JIT infrastructure to discuss APIs, features, and intrinsic support (the stackmap and patchpoint intrinsics).

Performance tracking & benchmarking infrastructure

Kristof Beyls - ARM, Chad Rosier - QuIC, Chris Matthew - Apple, Tobias Grosser - ETH, Renato Golin - Linaro
Having at least some public performance tracking that the majority of the community cares about would make it easier to collaborate for all developers improving the quality of LLVM-generated code. During last year's BoF, we identified some key shortcomings in LNT to be able to produce low-noise performance numbers on the test-suite and most of them have been fixed recently.

Debug Info BOF

Eric Christopher - Google
There's been quite a bit of interest in a Debug Info BOF following the one we had last year at the 2013 Developer Conference. This BOF will cover everything from scaling our current handling of debug information, to ongoing work on debug information correctness, and future proposals for DWARF standardization.

Improving LLVM for remote test execution

Brian Rzycki - Samsung Austin R&D Center
Today's LLVM testing frameworks focus on testing native compilers on the same host where the test is compiled. The goal of this BoF is to examine the components necessary for change and to discuss potential solutions that allow for running tests on remote hosts in a way that best benefits the overall community.

Future directions and features for LLDB

Deepak Mathews Panickal - Codeplay Software

GPU Implementers BoF

Tom Stellard - AMD, Owen Anderson - Apple
LLVM is rapidly gaining popularity as a compilation framework for graphics processors. This Birds of a Feather session will focus on issues of interest to implementers of GPU targets in LLVM. Topics of discussion may include:

LLVM Inliner Improvements

Yin Ma - QuIC, Ana Pazos - QuIC, Jiangling Liu - ARM
Discuss the opportunities to extend the LLVM inliner and our work greedy inliner.

LTO

Tony Linthicum - QuIC, Dan Palermo - QuIC
Discuss LTO's current state and potential future improvements. Potential topics include, but are not limited to, the following: passing command line arguments to LTO, profile driven LTO optimizations, compile time improvements and performance improvements.

Lld

Shankar Easwaran - QuIC, Daniel Stewart - QuIC
The lld linker is a solid foundation for a general-purpose linker, as well as a set of libraries for creating linker-like tools. We will talk about the current status of lld, what the shortcomings are for a production linker, and what major areas need to be implemented, such as LTO support, diagnostics, extensibility.

Poster Abstracts

LLVM for Interactive Modeling and High Performance Simulation

Peng Cheng; Nathan Brewton; Dale Martin - The MathWorks, Inc. [Slides]
To enable inter-module optimization during interactive modeling for high performance simulation of Simulink models, LLVM based JIT compilers have been developed at MathWorks. These JIT compilers support multiple threads on major platforms, including win32, win64, glnax64, and maci64, and achieve superior running time performance compared with previous shared library based compilers. This talk will present why and how LLVM is used to implement the JIT compilers as well as some challenges.

ISPC: clang-based front-end

Dmitry Babokin - Intel Corporation; James Brodman - Intel Corporation [Slides]
ISPC is a C-based language based on the SPMD (single program, multiple data) programming model that generates efficient SIMD code for modern processors without the need for complex analysis and autovectorization. The project uses the LLVM infrastructure for optimization and code generation but originally used a custom front-end. The poster describes our experience with building a clang-based front-end and the engineering problems we have encountered introducing the concept of “varying” types to clang.

A C++ ABI Test Suite

Sunil Srivastava - Sony Computer Entertainment [Slides]
This poster describes the design of an Itanium C++ ABI Test Suite that verifies a compiler’s compliance against the ABI specification to ensure link compatibility.

Software Visualizer (SWViz): A tool for visually exploring Linux kernel with Clang

Harsh Vardhan Dwivedi - QuIC [Slides]
What is SWviz and why should I care about it anyway? SWViz is a tool to explore the call graph of a program. It’s chief advantage being able to leverage the linker and compiler (Clang) to generate accurate call-graphs. Through use of Swviz one can quickly gain an understanding of how a program is working. This leads to a massive cutdown in number of hours spent understanding the call-flow of any mature codebase. We’ll demonstrate use of SWViz with Linux Kernel.

Machine Guided Compilation and Compiling to Minimize Energy Usage

Simon Cook - Embecosm; Ed Jones - Embecosm [Slides]
Today we need compilers to optimize for energy rather than just size or speed. In this poster we present the results of integrating machine learning with LLVM, so the compiler can be trained on which optimizations minimize energy. As a side effect of energy optimization we also see significant performance benefits - nearly doubling performance compared to -O3 in some cases.

Translating Java into LLVM IR to Detect Security Vulnerabilities

Cristina Cifuentes - Oracle Labs Australia, Oracle; Nathan Keynes - Parfait, Oracle; John Gough - Oracle Labs Australia, Oracle; Diane Corney - Oracle Labs Australia, Oracle; Lin Gao - Parfait, Oracle; Manuel Valdiviezo - Parfait, Oracle; Andrew Gross - Java Security, Oracle [Slides]
This poster describes one of several new Java security vulnerabilities and how we reused and extended LLVM’s IR in order to detect such vulnerability. One year later, full support for Java 7 and 8 are in place, along with various analyses that detect Java platform vulnerabilities.

Intel® AVX-512 architecture evolution and support in Clang/LLVM

Zinovy Nis - Intel Corporation; Robert Khasanov - Intel Corporation [Slides]
Intel® AVX-512 vector ISA continues to evolve. It has recently been enriched with new groups of instructions operating on different vector lengths and different vector element sizes. We’ll present the current status of new AVX-512 features enabling in CLANG/LLVM and show how these features can be exploited for performance improvements particularly by vectorizer.

mcsema

Artem Dinaburg - Trail of Bits, Inc.; Andrew Ruef - Trail of Bits, Inc.; Jay Little - Trail of Bits, Inc. [Slides]
mcsema is a framework for analyzing and transforming machine-code programs to LLVM bitcode. It supports translation of x86 machine code, including integer, floating point, and SSE instructions.

Carte++: A LLVM-Based Compiler Targeting FPGAs

Jeffrey Hammes; Lisa Krause; Matthew O’Connor; Jon Steidel - SRC Computers, LLC [Slides]
SRC Computers, LLC is presenting a poster on its upcoming Carte++ compiler release showing how the Clang/LLVM infrastructure has provided full language support and a rich set of compiler optimizations upon which to build the Carte++ code generator. The poster will illustrate specific optimizations and compiler challenges unique to SRC's FPGA-based hardware.

Logistics

DoubleTree by Hilton San Jose 2050 Gateway Place San Jose, CA 95110

You may book their hotel stay by using this link.

Rate: $209 plus local taxes for superior double queen or superior king rooms

Self Parking is $19 per day.