Researcher, Alibaba Group
1 paper at NeurIPS 2025
A new benchmark to evaluate visual caption for MLLMs with considering both correctness and thoroughness.