Intern, ELLIS Institute Tübingen
1 paper at NeurIPS 2025
We present the shortcomings of existing dropout-based methods in modeling long-range tasks.