1 paper across 1 session
We introduce DrVD-Bench, the first benchmark to evaluate vision-language models' clinical reasoning ability across graded medical imaging tasks.