1 paper across 1 session
Automatically detecting task-specific important tokens to accelerate speculative decoding