Full Professor, Tsinghua University, Tsinghua University
2 papers at NeurIPS 2025
The proposed brain-inspired CHT Soft Rule with Sigmoid Decay Density (CHTss) achieves comparable even better performance compared to fully connected models across various tasks, enabling high sparsity in Transformers and LLMs.