Researcher, Google
2 papers at NeurIPS 2025
Risk management processes as a way of improving, assessing, and comparing benchmark reliability result in a benchmark of benchmarks
A Novel Dataset with Demographically Intersectional Visual Evaluations (DIVE) for Pluralistic Alignment of Text-to-Image models