PhD student, The University of Hong Kong
1 paper at NeurIPS 2025
a unified multimodal model purely based on discrete flow matching, achieving comparable performance with AR-based MLLMs