4 papers across 3 sessions
We introduce ORIGAMISPACE, a new origami dataset and benchmark to evaluate MLLMs in Multi-Step Spatial Reasoning with Mathematical Constraints.
Loong-X enables hands-free image editing using multimodal neural signals, achieving performance comparable to text-driven methods by combining BCIs with the proposed diffusion-based generative methods.