today local_bar

Yibin Wang

Intern, University of Illinois at Urbana-Champaign

2 papers at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay

#508 · Yifan Sun, Jingyan Shen, Yibin Wang, Tianyu Chen, Zhendong Wang, Mingyuan Zhou, Huan Zhang

We propose two techniques to improve the data efficiency of LLM RL fine-tuning: difficulty-targeted online data selection and rollout replay.

Poster Session 3

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Training-Free Bayesianization for Low-Rank Adapters of Large Language Models

#600 · Haizhou Shi, Yibin Wang, Ligong Han, Huan Zhang, Hao Wang

A Training-Free Bayesianization approach is proposed for LLM adapters that achieves better uncertainty estimation.