today local_bar

Tuo Zhao

Associate Professor, Georgia Institute of Technology

4 papers at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 2

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

#4104 Spotlight · Yuezhou Hu, Jiaxin Guo, Xinyu Feng, Tuo Zhao

Ask a Strong LLM Judge when Your Reward Model is Uncertain

#3719 · Zhenghao Xu, Qin Lu, Qingru Zhang, Liang Qiu, Ilgee Hong, Changlong Yu, Wenlin Yao, Yao Liu, Haoming Jiang, Lihong Li, Hyokun Yun, Tuo Zhao

We propose an uncertainty-based routing framework that efficiently complements a fast RM with a strong but costly LLM judge.

Poster Session 5

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

A Minimalist Example of Edge-of-Stability and Progressive Sharpening

#4004 · Liming Liu, Zixuan Zhang, Simon Shaolei Du, Tuo Zhao

A new minimalist example to understand the Edge of Stability and Progressive Sharpening phenomenon

Poster Session 6

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Think-RM: Enabling Long-Horizon Reasoning in Generative Reward Models

#3613 · Ilgee Hong, Changlong Yu, Liang Qiu, Weixiang Yan, Zhenghao Xu, Haoming Jiang, Qingru Zhang, Qin Lu, Xin Liu, Chao Zhang, Tuo Zhao

We propose Think-RM, a training framework for generative reward models that enables long-horizon reasoning, and introduce a pairwise RLHF pipeline that directly optimizes policies using pairwise preference rewards.