Researcher, Docta.ai
1 paper at NeurIPS 2025
A peer prediction-based automatic evaluator for scoring human values in crowdsourcing datasets contaminated by LLM-generated responses.