?
today
local_bar
search
Visual-Language Model
3 papers across 2 sessions
Poster Session 4
2 papers
Thursday, December 4, 2025 · 4:30 PM → 7:30 PM
Exhibit Hall C,D,E
Actial: Activate Spatial Reasoning Ability of Multimodal Large Language Models
star
#4701
·
Xiaoyu Zhan, Wenxuan Huang, Hao Sun, Xinyu Fu, Changfeng Ma, Shaosheng Cao, Bohan Jia, Shaohui Lin, Zhenfei Yin, LEI BAI, Wanli Ouyang, Yuanqi Li, Jie Guo, Yanwen Guo
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning
star
#5302
·
Yang Liu, Ming Ma, Xiaomin Yu, Pengxiang Ding, Han Zhao, Mingyang Sun, Siteng Huang, Donglin Wang
Poster Session 6
1 paper
Friday, December 5, 2025 · 4:30 PM → 7:30 PM
Exhibit Hall C,D,E
The Narrow Gate: Localized Image-Text Communication in Native Multimodal Models
star
#1114
·
Alessandro Serra, Francesco Ortu, Emanuele Panizon, Lucrezia Valeriani, Lorenzo Basile, Alessio Ansuini, Diego Doimo, Alberto Cazzaniga