Vision-Language Understanding

1 paper across 1 session

Poster Session 2

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

CausalVTG: Towards Robust Video Temporal Grounding via Causal Inference

#4807 · Qiyi Wang, Senda Chen, Ying Shen

We propose a causal framework for video temporal grounding that mitigates confounding biases and improves robustness to linguistic variations and irrelevant queries.