PhD student, The University of Tokyo, Tokyo Institute of Technology
1 paper at NeurIPS 2025
We developed an iterative weakly supervised pipeline to refine LLM-generated pseudo-labels, consistently outperforming original LLMs and existing self-refinement methods across diverse datasets, while effectively supporting LLM safety alignment.