Yuanhang Yang

PhD student, Institute of Science Tokyo

1 paper at NeurIPS 2025

OpenReview· Semantic Scholar· Google Scholar

Poster Session 1

1 paper

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

UMoE: Unifying Attention and FFN with Shared Experts

#3401 Spotlight · Yuanhang Yang, Chaozheng Wang, Jing Li

A novel MoE architecture that extends mixture-of-experts to both attention and feed-forward layers with unified expert designs and attention-FFN parameter sharing.