Researcher, Polynome AI
1 paper at NeurIPS 2025
Training-free depth pruning method for transformer based models via approximating blocks with linear transformation