2 papers across 2 sessions
We propose a multi-objective Bayesian Optimization algorithm that shows state-of-the-art performance on a wide set of benchmark problems.
EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions