3 papers across 3 sessions
We introduce the ActiveVOO framework for active knowledge acquisition to identify, quantify, and prioritize task-relevant information for open-world embodied planning.
One-Shot Adaptive Visual Tokenizer
DualGround mitigates EOS token bias by introducing additional phrase-aware path for fine-grained video-language alignment.