visual token pruning

3 papers across 2 sessions

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models

#4815 · Jintao Tong, Wenwei Jin, Pengda Qin, Anqi Li, Yixiong Zou, Yuhong Li, Yuhua Li, Ruixuan Li

To address inefficiency from excessive visual tokens in LVLMs, we propose an information-flow perspective revealing dynamic redundancy emergence and introduce a method aligned with the model’s inherent behavior, outperforming all existing approaches.

Poster Session 2

2 papers

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

SCOPE: Saliency-Coverage Oriented Token Pruning for Efficient Multimodel LLMs

#4806 · Jinhong Deng, Wen Li, Joey Tianyi Zhou, Yang He

A novel visual token pruning method that jointly maximizes both the saliency and coverage of the selected visual tokens to better preserve semantic completeness.

Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs

#4718 · Qizhe Zhang, Mengzhen Liu, Lichen Li, Ming Lu, Yuan Zhang, Junwen Pan, Qi She, Shanghang Zhang

We propose a training-free visual token pruning method CDPruner for MLLM inference acceleration by maximizing the conditional diversity of retained tokens.