Linear Mode Connectivity

2 papers across 1 session

Poster Session 6

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

On Linear Mode Connectivity of Mixture-of-Experts Architectures

#3916 · Viet-Hoang Tran, Van Hoan Trinh, Khanh-Vinh Bui, Tan Nguyen

We investigate Linear Mode Connectivity (LMC) in Mixture-of-Experts (MoE) architectures by analyzing their underlying permutation symmetries and proposing expert-matching algorithms that align independently trained MoEs to reveal LMC.

Generalized Linear Mode Connectivity for Transformers

#3919 · Alexander Theus, Alessandro Cabodi, Sotiris Anagnostidis, Antonio Orvieto, Sidak Pal Singh, Valentina Boeva

We propose a unified framework for model merging that leverages multiple symmetry classes to enable low- and zero-loss interpolation between independently trained Transformer models, including Vision Transformers and GPT-2.