?
today
local_bar
search
Benchmark Datasets
2 papers across 2 sessions
Poster Session 1
1 paper
Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM
Exhibit Hall C,D,E
CodeAssistBench (CAB): Dataset & Benchmarking for Multi-turn Chat-Based Code Assistance
star
#1617
·
Myeongsoo Kim, Shweta Garg, Baishakhi Ray, Varun Kumar, Anoop Deoras
Poster Session 5
1 paper
Friday, December 5, 2025 · 11:00 AM → 2:00 PM
Exhibit Hall C,D,E
Mars-Bench: A Benchmark for Evaluating Foundation Models for Mars Science Tasks
star
#5002
·
Mirali Purohit, Bimal Gajera, Vatsal Malaviya, Irish Mehta, Kunal Kasodekar, Jacob Adler, Steven Lu, Umaa Rebbapragada, Hannah Kerner