Open-ended Response Evaluation

1 paper across 1 session

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

MM-OPERA: Benchmarking Open-ended Association Reasoning for Large Vision-Language Models

#4612 · Zimeng Huang, Jinxin Ke, Xiaoxuan Fan, Yufeng Yang, Yang Liu, Liu Zhonghan, Zedi Wang, Junteng Dai, Haoyi Jiang, Yuyu Zhou, Keze Wang, Ziliang Chen

MM-OPERA is a benchmark of 11,497 open-ended association tasks (Remote-Item and In-Context) with explicit multi-hop reasoning and LLM-as-a-Judge process-reward evaluation to assess LVLMs' convergent and divergent associative thinking.