logo
today local_bar
Poster Session 1 · Wednesday, December 3, 2025 11:00 AM → 2:00 PM
#104

Bootstrapping Hierarchical Autoregressive Formal Reasoner with Chain-of-Proxy-Autoformalization

NeurIPS Project Page OpenReview

Abstract

Deductive formal problem-solving (D-FPS) enables process-verified, human-aligned problem-solving by implementing deductive solving processes within formal theorem proving (FTP) environments. However, current methods fail to address the misalignment between informal and formal reasoning granularity and suffer from inefficiency due to backtracking and error propagation. Moreover, the extreme scarcity of formal problem-solution pairs further hinders progress.
For the first gap, we propose HAR (Hierarchical Autoregressive Formal Reasoner), a novel reasoning pipeline. HAR decouples informal-aligned drafting and detailed proving, and formulates solution construction as autoregressive generation with per-step feedback. Second, we propose CoPA (Chain-of-Proxy-Autoformalization), a data generation pipeline that cascades statement autoformalization, proof drafting, and proof search as a proxy autoformalization path.
Experiments demonstrate significant improvements: trained on data bootstrapped by CoPA, HAR achieves superior performance on FormalMath500 () and MiniF2F-Solving () with lower computational budget. Explorations reveal promising directions in formal solution pruning and informal dataset denoising.