Researcher, Skywork AI
2 papers at NeurIPS 2025
We introduce an RL framework that unify the training of answer generation and verification in a single model.