Undergrad student, Peking University
2 papers at NeurIPS 2025
Safe RLHF-V, the multimodal safety alignment framework.
A human preference dataset for multi-turn interleaved multimodal understanding and generatin tasks