1 paper across 1 session
MM-OPERA is a benchmark of 11,497 open-ended association tasks (Remote-Item and In-Context) with explicit multi-hop reasoning and LLM-as-a-Judge process-reward evaluation to assess LVLMs' convergent and divergent associative thinking.