Undergrad student, Tongji University
1 paper at NeurIPS 2025
We propose a causal framework for video temporal grounding that mitigates confounding biases and improves robustness to linguistic variations and irrelevant queries.