PhD student, University of Maryland, College Park
1 paper at NeurIPS 2025
We propose a text-aligned visual representation to unify both visual understanding and generation within a single MLLM.