PhD student, Chinese Academy of Sciences
2 papers at NeurIPS 2025
We present SolidGeo, the first benchmark focused on solid geometry, revealing the poor performance of current MLMs on solid geometry and analyzing the inference flaws of current models.
We propose a general reinforcement learning framework tailored for interleaved multimodal tasks by permutating image sequences to simulate varied positional relationships and explore more spatial and positional diversity