The Cyprus Institute - NeurIPS 2025

🏛 The Cyprus Institute

1 paper across 1 session

Poster Session 5

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders

#1110 · James Oldfield, Shawn Im, Sharon Li, Mihalis Nicolaou, Ioannis Patras, Grigorios Chrysos

MxDs show that dense layers are more faithfully represented by mixtures of specialized sublayers than by sparsely activating neurons, while remaining just as interpretable.