PhD student, Pennsylvania State University
1 paper at NeurIPS 2025
We show that visual input induces an activation shift that weakens VLM safety and propose a calibration method to mitigate this effect.