Csaba Dékány

Intern, Institute for Computer Science, Artificial Intelligence and Technology

1 paper at NeurIPS 2025

OpenReview· Semantic Scholar· Google Scholar

Poster Session 3

1 paper

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

MixAT: Combining Continuous and Discrete Adversarial Training for LLMs

#1403 · Csaba Dékány, Stefan Balauca, Dimitar Iliev Dimitrov, Robin Staab, Martin Vechev

We mix discrete and continuous adversarial attacks to adversarially train more robust LLMs. We evaluate our models in different realistic inference settings and show that they are more robust while matching the training cost of other SoTA models.