Sparsity - NeurIPS 2025

Sparsity

6 papers across 3 sessions

Poster Session 4

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Stochastic Shortest Path with Sparse Adversarial Costs

#3309 · Emmeran Johnson, Alberto Rumi, Ciara Pike-Burke, Patrick Rebeschini

We study the stochastic shortest path problem with sparse adversarial costs and under known transitions characterize the minimax regret achieved by OMD with a novel $\ell_r$-norm regularizer with $r \in [1,2]$.

Lua-LLM: Learning Unstructured-Sparsity Allocation for Large Language Models

#1904 · Mingge Lu, Jingwei Sun, Junqing Lin, Zechun Zhou, Guangzhong Sun

We propose a global pruning framework that efficiently learns unstructured sparsity for LLMs.

Poster Session 5

1 paper

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Kinetics: Rethinking Test-Time Scaling Law

#5419 · Ranajoy Sadhukhan, Zhuoming Chen, Haizhong Zheng, Beidi Chen

Poster Session 6

3 papers

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion

#5011 Spotlight · Akide Liu, Zeyu Zhang, Zhexin Li, Xuehai Bai, Yuanjie Xing, Yizeng Han, Jiasheng Tang, Jichao Wu, Mingyang Yang, Weihua Chen, Jiahao He, Yuanyu He, Fan Wang, Reza Haffari, Bohan Zhuang

FPSAttention is a training-aware FP8 quantization and sparsity co-design for video diffusion models that achieves up to 4.96× speedup without quality loss by aligning 3D tile granularity, denoising-step adaptation, and hardware-efficient kernels.

Masked Gated Linear Unit

#3307 · Yukito Tajima, Nakamasa Inoue, Yusuke Sekikawa, Ikuro Sato, Rio Yokota

Unified Scaling Laws for Compressed Representations

#1505 · Andrei Panferov, Alexandra Volkova, Ionut-Vlad Modoranu, Vage Egiazarian, Mher Safaryan, Dan Alistarh

We investigate new scaling laws which predict the scaling of LLMs when training them over quantized or sparse representations.