Lecturer, Ho Chi Minh City University of Economics
2 papers at NeurIPS 2025
We analyze the flow of tokens across attention layers and use these insights to enhance performance of Transformers.
We propose and analyze a new class of unbalanced weak optimal transport (OT) problems with total variation penalties.