Yujin Tang

Researcher, Sakana AI

1 paper at NeurIPS 2025

OpenReview· Semantic Scholar· Google Scholar

Poster Session 4

1 paper

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Reinforcement Learning Teachers of Test Time Scaling

#3519 · Edoardo Cetin, Tianyu Zhao, Yujin Tang

We introduce a new class of Reinforcement Learned Teachers trained to provide effective reasoning traces for downstream distillation, yielding more effective data for distillation and cold-starting than orders of magnitude larger reasoning LMs.