Full Professor, University of California, Berkeley
2 papers at NeurIPS 2025
We introduce a method that allows adversarial optimization to be used in general-sum settings to train more robust and diverse policies.
We propose SimpleStrat for diversifying LLM generations and introduce CoverageQA a benchmark of multi-answer questions for evaluating coverage.