2 papers across 2 sessions
We introduce the first theoretical framework for analyzing LLM reasoning errors, and bridge two typical sampling-based test-time scaling methods to achieve both low error and fast convergence.