NLP expert, Kuaishou Technology
2 papers at NeurIPS 2025
A kv cache compression method for large vision-language models