RNN - NeurIPS 2025

RNN

5 papers across 3 sessions

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

When Do Transformers Outperform Feedforward and Recurrent Networks? A Statistical Perspective

#4000 · Alireza Mousavi-Hosseini, Clayton Sanford, Denny Wu, Murat Erdogdu

We prove a purely statistical separation between Transformers and other architectures such as feedforward and recurrent networks, where Transformers are more sample-efficient at learning sparse sequence models.

Linear Attention for Efficient Bidirectional Sequence Modeling

#3504 · Arshia Afzal, Elias Abad Rocamora, Leyla Candogan, Pol Puigdemont, Francesco Tonin, Yongtao Wu, Mahsa Shoaran, Volkan Cevher

We propose LION, a framework for extending Linear Transformers to the bidirectional setting by providing three theoretically equivalent representations: full attention, bidirectional RNN, and chunkwise parallel form.

Poster Session 2

2 papers

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

RNNs perform task computations by dynamically warping neural representations

#2109 · Arthur Pellegrino, Angus Chadwick

RNNs used in computational neuroscience lie on manifolds whose geometry provides insights into their computations.

Hardware-aligned Hierarchical Sparse Attention for Efficient Long-term Memory Access

#5419 · Xiang Hu, Jiaqi Leng, Jun Zhao, Kewei Tu, Wei Wu

A sparse attention mechanism balances efficiency, long-range random access flexibility and length generalization ability

Poster Session 3

1 paper

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM Kernels

#3509 · Maximilian Beck, Korbinian Pöppel, Phillip Lippe, Sepp Hochreiter

We introduce TiledFlashLinearAttention a faster kernel algorithm for Linear RNNs and mLSTMs by improved Sequence Parallelism.