Efficient Transformers

2 papers across 2 sessions

Poster Session 4

1 paper

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Fourier Token Merging: Understanding and Capitalizing Frequency Domain for Efficient Image Generation

#3604 · Jiesong Liu, Xipeng Shen

Poster Session 5

1 paper

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Polar Sparsity: High Throughput Batched LLM Inferencing with Scalable Contextual Sparsity

#3513 · Susav Shrestha, Bradley Settlemyer, Nikoli Dryden, Narasimha Reddy

Polar Sparsity scales contextual sparsity to large batches by exploiting stable attention head sparsity and using efficient GPU kernels, achieving up to 2.2× speedups with minimal accuracy loss.