Researcher, The Research Institution of China Mobile
1 paper at NeurIPS 2025
A novel two‑stage Direct Preference Optimization framework that enables short‑context vision‑language models to robustly understand ultra‑long videos without any long‑video annotations.