MS student, Jinan University
1 paper at NeurIPS 2025
MM-OPERA is a benchmark of 11,497 open-ended association tasks (Remote-Item and In-Context) with explicit multi-hop reasoning and LLM-as-a-Judge process-reward evaluation to assess LVLMs' convergent and divergent associative thinking.