Postdoc, ETHZ - ETH Zurich
2 papers at NeurIPS 2025
We uncover the emergent open-vocabulary semantic segmentation capability of diffusion transformers and show that amplifying this property enhances both segmentation and image generation.