MS student, EPFL - EPF Lausanne
1 paper at NeurIPS 2025
Interpretability methods based on linear, orthogonal features fall short for modern neural representations, which are often hierarchical and nonlinear. Better results come from aligning methods with the true structure of these representations.