1 paper across 1 session
Debugging today's chain-of-thought reasoning for video understanding and introducing a novel reward for MLLM post-training to ground video reasoning in visual evidence.