visual understanding

2 papers across 2 sessions

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

VLMs have Tunnel Vision: Evaluating Nonlocal Visual Reasoning in Leading VLMs

#4911 Spotlight · Shmuel Berman, Jia Deng

Our structured dataset allows us to analyze how model vision compares to human perception and to determine whether VLMs perform similar visual reasoning algorithms as humans can.

Poster Session 4

1 paper

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

EA3D: Online Open-World 3D Object Extraction from Streaming Videos

#4910 · Xiaoyu Zhou, Jingqi Wang, Yuang Jia, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang

A unified online framework for open-world 3D object extraction.