Full Professor, Nanyang Technological University
2 papers at NeurIPS 2025
Exploiting the overfitting of LLMs, we use only ten benign QA pairs to fine-tune and jailbreak them.