Code Execution

2 papers across 2 sessions

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Agentic RL Scaling Law: Spontaneous Code Execution for Mathematical Problem Solving

#306 · Xinji Mai, Haotian Xu, Xing W, Weinong Wang, Yingying Zhang, Wenqiang Zhang

This paper presents ZeroTIR, revealing agent‑level RL scaling laws that tie training steps, code‑call frequency, response length, and accuracy, and surpassing ZeroRL and SFT baselines on challenging math benchmarks.

Poster Session 4

1 paper

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

CodeCrash: Exposing LLM Fragility to Misleading Natural Language in Code Reasoning

#2610 · Man Ho Lam, Chaozheng Wang, Jen-Tse Huang, Michael R Lyu