Postdoc, ETHZ - ETH Zurich
2 papers at NeurIPS 2025
We enhance diffusion models to be able to recall content from long back in a sequence in order to produce consistent content
We propose LangHOPS, the first Multimodal Large Language Model (MLLM)-based framework for open-vocabulary object–part instance segmentation.