today local_bar

Yuxin Zhang

PhD student, Xiamen University

2 papers at NeurIPS 2025

OpenReview· Semantic Scholar· Google Scholar

Poster Session 5

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval

#5514 · Wenhao Li, Yuxin Zhang, Gen Luo, Haiyuan Wan, ZiYang Gong, Fei Chao, Rongrong Ji

KV cache retrieval for large language models using nonlinear hashing function.

Discovering Important Experts for Mixture-of-Experts Models Pruning Through a Theoretical Perspective

#3600 · Weizhong Huang, Yuxin Zhang, Xiawu Zheng, Fei Chao, Rongrong Ji, Liujuan Cao

We propose a superior MoE pruning framework that determines the importance of experts in MoE models through a theoretical perspective.