PhD student, Stanford University
1 paper at NeurIPS 2025
We propose Reference-free Preference Steering (RePS), a bidirectional preference-optimization objective that jointly does concept steering and suppression.