MS student, University of Oxford
1 paper at NeurIPS 2025
Using a heterogeneous Mixture-of-Experts model architecture, we show that brain-like processing pathways form due to inductive biases on processing complexity and expert dropout