2 papers across 1 session
We propose a superior MoE pruning framework that determines the importance of experts in MoE models through a theoretical perspective.