Lightweight and Locality-Aware Composition of Black-Box Subroutines (PLDI 2025 - PLDI Research Papers)

Mon 16 - Fri 20 June 2025 Seoul, South Korea

Who

Manya Bansal, Dillon Sharlet, Jonathan Ragan-Kelley, Saman Amarasinghe

Track

PLDI 2025 PLDI Research Papers

Time Zone

The program is currently displayed in (GMT+09:00) Seoul.

Use conference time zone: (GMT+09:00) SeoulSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 18 Jun 2025 16:20 - 16:40 at Orchid - High Performance Computing Chair(s): Charith Mendis

Abstract

Subroutines are essential building blocks in software design: users encapsulate common functionality in libraries and write applications by composing calls to subroutines. Unfortunately, performance may be lost at subroutine boundaries due to reduced locality and increased memory consumption. Operator fusion helps recover the performance lost at composition boundaries. Previous solutions fuse operators by manually rewriting code into monolithic fused subroutines, or by relying on heavy-weight compilers to generate code that performs fusion. Both approaches require a semantic understanding of the entire computation, breaking the decoupling necessary for modularity and reusability of subroutines.

In this work, we attempt to identify the minimal ingredients required to fuse computations, enabling composition of subroutines without sacrificing performance or modularity. We find that, unlike previous approaches that require a semantic understanding of the computation, most opportunities for fusion require understanding only data production and consumption patterns.Exploiting this insight, we add fusion on top of black-box subroutines by proposing a lightweight enrichment of subroutine declarations to expose data-dependence patterns. We implement our approach in a system called Fern, and demonstrate Fern's benefits by showing that it is competitive with state-of-the-art, high-performance libraries with manually fused operators, can fuse across library and domain boundaries for unforeseen workloads, and can deliver speedups of up to $5\times$ over unfused code.

DOI

https://doi.org/10.1145/3729292

Manya Bansal

Massachusetts Institute of Technology

United States

Dillon Sharlet

Google

United States

Jonathan Ragan-Kelley

Massachusetts Institute of Technology

United States

Saman Amarasinghe

Massachusetts Institute of Technology

United States

Time Zone

The program is currently displayed in (GMT+09:00) Seoul.

Use conference time zone: (GMT+09:00) SeoulSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Wed 18 Jun
Displayed time zone: Seoul change

16:00 - 17:20	High Performance ComputingPLDI Research Papers at Orchid Chair(s): Charith Mendis University of Illinois at Urbana-Champaign

16:00 20m Talk		Task-Based Tensor Computations on Modern GPUs PLDI Research Papers Rohan Yadav Stanford University, Michael Garland NVIDIA, Alex Aiken Stanford University, Michael Bauer NVIDIA DOI
16:20 20m Talk		Lightweight and Locality-Aware Composition of Black-Box Subroutines PLDI Research Papers Manya Bansal Massachusetts Institute of Technology, Dillon Sharlet Google, Jonathan Ragan-Kelley Massachusetts Institute of Technology, Saman Amarasinghe Massachusetts Institute of Technology DOI
16:40 20m Talk		Modular Construction and Optimization of the UZP Sparse Format for SpMV on CPUs PLDI Research Papers Alonso Rodriguez Universidade da Coruña, Santoshkumar T. Tongli Colorado State University, Emily Tucker Colorado State University, Louis-Noël Pouchet Colorado State University, Gabriel Rodríguez Universidade da Coruña, Juan Tourino Universidade da Coruña DOI
17:00 20m Talk		Dynamic Robustness Verification against Weak MemoryRemote PLDI Research Papers Roy Margalit Tel Aviv University, Michalis Kokologiannakis ETH Zurich, Shachar Itzhaky Technion, Ori Lahav Tel Aviv University DOI