Researcher, Fujitsu Limited
1 paper at NeurIPS 2025
We present the first pure Mamba-based architecture for video action detection, achieving Transformer-level performance with significantly reduced computation, inference time and memory costs.