Yingying Zhang

Assistant Professor, East China Normal University

2 papers at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 1

2 papers

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Agentic RL Scaling Law: Spontaneous Code Execution for Mathematical Problem Solving

#306 · Xinji Mai, Haotian Xu, Xing W, Weinong Wang, Yingying Zhang, Wenqiang Zhang

This paper presents ZeroTIR, revealing agent‑level RL scaling laws that tie training steps, code‑call frequency, response length, and accuracy, and surpassing ZeroRL and SFT baselines on challenging math benchmarks.

Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation

#416 · Feichen Gan, Lu Youcun, Yingying Zhang, Yukun Liu

We propose a unified conformal prediction framework for infinite-horizon policy evaluation that seamlessly accommodates both on-policy and off-policy scenarios.