A Design Proposal for GraPPL: Probabilistic Programming with Low-Level, High-Performance GPU Programmable Inference (LAFI 2026)

Sun 11 - Sat 17 January 2026 Rennes, France

Who

Karen Chung, Elias Rojas Collins, McCoy Reynolds Becker, Mathieu Huot, Vikash Mansinghka

Track

LAFI 2026

Time Zone

The program is currently displayed in (GMT+01:00) Brussels, Copenhagen, Madrid, Paris.

Use conference time zone: (GMT+01:00) Brussels, Copenhagen, Madrid, ParisSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 11 Jan 2026 10:05 - 10:15 at Salle 13 - First Session Chair(s): Alexander K. Lew

Abstract

Vectorized probabilistic programming languages (PPLs) support high-performance, data-parallel programmable inference. To expose high-level programming models to users, vectorization in these systems hides the concerns of memory management and parallel threading, resulting in black-boxed parallel compilation and restrictions on custom optimizations. We present a design for GraPPL—a GPU-programmable PPL—which exposes high-level features, including traces and probabilistic generative function interfaces, while enabling GPU-programmable control over low-level runtime and memory profiles. GraPPL allows models to be expressed as sequential C++ functions and/or vectorized CUDA GPU kernels which support random choice expressions; GraPPL’s template-specialized interpreters transform these expressions into various probabilistic semantics, while automatically maintaining coherent execution traces of the probabilistic program across CPU and GPU execution contexts. We demonstrate GraPPL’s efficiency in an example on blocks Gibbs sampling on factor graphs, which shows a 3× gain over JAX-based implementations with equivalent levels of automation and modularity.

Karen Chung

Massachusetts Institute of Technology

United States

Elias Rojas Collins

MIT

McCoy Reynolds Becker

MIT

Mathieu Huot

MIT

Vikash Mansinghka