Tenure-Track Faculty, CISPA Helmholtz Center for Information Security
1 paper at NeurIPS 2025
GASP is a novel black-box attack framework that efficiently explores the latent space to generate human-readable adversarial suffixes, significantly improving jailbreak success rates while maintaining prompt coherence.