Language-Model Probabilistic Programming for Improving Autoformalization via Cycle Consistency and Incremental Type-Checking (LAFI 2026)

Sun 11 - Sat 17 January 2026 Rennes, France

Who

Mauricio Barba da Costa, Fabian Zaiser, Katherine Collins, Romir Patel, Timothy O'Donnell, Alexander K. Lew, Joshua B. Tenenbaum, Vikash K. Mansinghka, Cameron Freer

Track

LAFI 2026

Time Zone

The program is currently displayed in (GMT+01:00) Brussels, Copenhagen, Madrid, Paris.

Use conference time zone: (GMT+01:00) Brussels, Copenhagen, Madrid, ParisSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 11 Jan 2026 16:24 - 16:34 at Salle 13 - Fourth Session Chair(s): Guillaume Baudart

Abstract

Autoformalization, the task of translating natural-language mathematics into a formal language such as Lean, has the potential to change the nature of mathematical discovery — providing mathematicians more certainty via verification during their proof-discovery process and even facilitating automated mathematical reasoning. In recent years, language models have made great strides towards autoformalization, but the task remains challenging both because of the rigidness of the target formal languages and the inherent uncertainty in informal texts. We propose a method for autoformalization via language-model probabilistic programming. Using tools for constrained generation from language models (LMs), we probabilistically steer LMs by incorporating two correctness signals: (i) incremental type-checking in Lean to rule out generations with no valid completion, and (ii) cycle-consistency whereby a backtranslation of the formalized statement is constrained to be similar to the original informal statement. We demonstrate that both of these signals can improve autoformalization quality on the miniF2F and LeanEuclid Book datasets, while requiring fewer tokens and shorter runtime.

Mauricio Barba da Costa

MIT

Fabian Zaiser

MIT

Katherine Collins

MIT

Romir Patel

MIT

Timothy O'Donnell

Alexander K. Lew

Yale University

United States

Joshua B. Tenenbaum

Massachusetts Institute of Technology

United States

Vikash K. Mansinghka

Massachusetts Institute of Technology

United States

Cameron Freer

Massachusetts Institute of Technology

United States

Time Zone

The program is currently displayed in (GMT+01:00) Brussels, Copenhagen, Madrid, Paris.

Use conference time zone: (GMT+01:00) Brussels, Copenhagen, Madrid, ParisSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Sun 11 Jan
Displayed time zone: Brussels, Copenhagen, Madrid, Paris change

16:00 - 18:00	Fourth SessionLAFI at Salle 13 Chair(s): Guillaume Baudart Inria

16:00 10m Talk		Multi-Agent Systems for Traceable Bayesian Workflow LAFI Xianda Sun University of Cambridge, Andrew D. Gordon Cogna and University of Edinburgh, Hong Ge University of Cambridge File Attached
16:12 10m Talk		Grammar-Constrained LLM Generation for Reliable and Efficient Probabilistic Program Synthesis LAFI Madhav Kanda University of Illinois Urbana-Champaign, Shubham Ugare Meta, Sasa Misailovic University of Illinois at Urbana-Champaign
16:24 10m Talk		Language-Model Probabilistic Programming for Improving Autoformalization via Cycle Consistency and Incremental Type-Checking LAFI Mauricio Barba da Costa MIT, Fabian Zaiser MIT, Katherine Collins MIT, Romir Patel MIT, Timothy O'Donnell , Alexander K. Lew Yale University, Joshua B. Tenenbaum Massachusetts Institute of Technology, Vikash K. Mansinghka Massachusetts Institute of Technology, Cameron Freer Massachusetts Institute of Technology
16:35 80m Poster		Poster Session LAFI