MS student, University of Science and Technology of China
1 paper at NeurIPS 2025
A new benchmark to evaluate visual caption for MLLMs with considering both correctness and thoroughness.