PhD student, Johannes Kepler Universität Linz
2 papers at NeurIPS 2025
We introduce TiledFlashLinearAttention a faster kernel algorithm for Linear RNNs and mLSTMs by improved Sequence Parallelism.
Enhancing linear RNNs to multi-dimensional structures, stable and parallelizable.