Video Large Language Models; Long Video Understanding; Preference Optimization

1 paper across 1 session

Poster Session 4

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization

#4713 · Zhenpeng Huang, Jiaqi Li, zihan jia, Xinhao Li, Desen Meng, Lingxue Song, Xi Chen, Liang Li, Limin Wang

A novel two‑stage Direct Preference Optimization framework that enables short‑context vision‑language models to robustly understand ultra‑long videos without any long‑video annotations.