Undergrad student, Birla Institute of Technology and Science, K K Birla Goa Campus
1 paper at NeurIPS 2025
GASP is a novel black-box attack framework that efficiently explores the latent space to generate human-readable adversarial suffixes, significantly improving jailbreak success rates while maintaining prompt coherence.