Postdoc, University of Science and Technology of China
2 papers at NeurIPS 2025
A new benchmark to evaluate visual caption for MLLMs with considering both correctness and thoroughness.