MS student, Korea Advanced Institute of Science & Technology
1 paper at NeurIPS 2025
We uncover the emergent open-vocabulary semantic segmentation capability of diffusion transformers and show that amplifying this property enhances both segmentation and image generation.