today local_bar

Hosung Song

Researcher, LG Corporation

1 paper at NeurIPS 2025

OpenReview· Semantic Scholar· Google Scholar

Poster Session 2

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

KL Penalty Control via Perturbation for Direct Preference Optimization

#404 · Sangkyu Lee, Janghoon Han, Hosung Song, Stanley Jungkyu Choi, Honglak Lee, Youngjae Yu

Instance-level adaptive KL penalty control method for Direct Preference Optimization