PhD student, The Hong Kong University of Science and Technology
2 papers at NeurIPS 2025
A robust watermarking scheme for intellectual property protection in diffusion models.
We propose a lifelong safety alignment framework where a Meta-Attacker and Defender co-evolve to uncover and defend against unseen jailbreaking strategies.