PhD student, Department of Electronic Engineering, Tsinghua University
1 paper at NeurIPS 2025
In this paper, we present Spatial-MLLM, a novel framework for spatial understanding and reasoning with only 2D inputs.