Assistant Professor, University of Waterloo
5 papers at NeurIPS 2025
We introduce MoCha, the first model for dialogue-driven movie shot generation.
We introduce a novel reasoning paradigm -- pixel-space reasoning. We identified the learning trap when cultivating this ability and proposed a curiosity-driven RL approach to address it.