PhD student, Department of Computer Science, University of Washington
1 paper at NeurIPS 2025
A framework and benchmark to evaluate language models' reasoning on imperfect tabular data