Undergrad student, Tsinghua University
1 paper at NeurIPS 2025
We introduce DrVD-Bench, the first benchmark to evaluate vision-language models' clinical reasoning ability across graded medical imaging tasks.