?
today
local_bar
search
model evaluation
2 papers across 2 sessions
Poster Session 2
1 paper
Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM
Exhibit Hall C,D,E
TCM-Ladder: A Benchmark for Multimodal Question Answering on Traditional Chinese Medicine
star
#1711
·
Jiacheng Xie, Yang Yu, Ziyang Zhang, Shuai Zeng, Jiaxuan He, Ayush Vasireddy, Xiaoting tang, Congyu Guo, Lening Zhao, Congcong Jing, Guanghui An, Dong Xu
Poster Session 5
1 paper
Friday, December 5, 2025 · 11:00 AM → 2:00 PM
Exhibit Hall C,D,E
Scaling Up Active Testing to Large Language Models
star
#110
·
Gabrielle Berrada, Jannik Kossen, Freddie Bickford Smith, Muhammed Razzak, Yarin Gal, Thomas Rainforth