validity - NeurIPS 2025

validity

2 papers across 2 sessions

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Validating LLM-as-a-Judge Systems under Rating Indeterminacy

#1515 · Luke Guerdan, Solon Barocas, Kenneth Holstein, Hanna Wallach, Steven Wu, Alex Chouldechova

We introduce a principled framework for validating LLM-as-a-judge systems under rating indeterminacy, where multiple ratings can be "correct."

Poster Session 6

1 paper

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Comparison requires valid measurement: Rethinking attack success rate comparisons in AI red teaming

#1110 · Alex Chouldechova, A. Feder Cooper, Solon Barocas, Abhinav Palia, Dan Vann, Hanna Wallach

We argue that conclusions drawn about relative system safety or attack method efficacy via AI red teaming are often not supported by evidence provided by attack success rate (ASR) comparisons.