program semantics

1 paper across 1 session

Poster Session 3

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Evaluating Program Semantics Reasoning with Type Inference in System

F

#1513 · Yifeng He, Luning Yang, Christopher Gonzalo, Hao Chen

We present a novel pair of benchmarks to evaluate the fundamental deductive reasoning abilities of test-time compute reasoning models on program semantics.