Assistant Professor, The Chinese University of Hong Kong, Shenzhen
2 papers at NeurIPS 2025
We introduce a RL framework to train LLM's reasoning and self-verification ability simultaneously.
We created a dataset to evaluate current models' ability to actively detect and alert risks based on the observations of user behaviors.