1 paper across 1 session
In this paper, we present Spatial-MLLM, a novel framework for spatial understanding and reasoning with only 2D inputs.