Assistant Professor, University of California, Santa Cruz
4 papers at NeurIPS 2025
A peer prediction-based automatic evaluator for scoring human values in crowdsourcing datasets contaminated by LLM-generated responses.
We introduce an RL framework that unify the training of answer generation and verification in a single model.
Learning the individualized treatment effects under the assumption of rank preservation.