PhD student, the University of Hong Kong, University of Hong Kong
6 papers at NeurIPS 2025
A data construction pipeline and a DiT framework for subject-driven video customization under multimodal control conditions
we propose Seg-VAR, a novel framework that rethinks segmentation as a conditional autoregressive mask generation problem