underspecification - NeurIPS 2025

today local_bar

underspecification

2 papers across 2 sessions

Poster Session 4

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?

#809 · Belinda Li, Been Kim, Zi Wang

Poster Session 5

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions

#1402 · Polina Kirichenko, Mark Ibrahim, Kamalika Chaudhuri, Samuel J. Bell

Our new benchmark AbstentionBench reveals reasoning models struggle to determine when not to answer.