Jihai Zhang

PhD student, The Chinese University of Hong Kong

1 paper at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 3

1 paper

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Unveiling the Compositional Ability Gap in Vision-Language Reasoning Model

#4701 · Tianle Li, Jihai Zhang, Yongming Rao, Yu Cheng

We introduce ComPABench to evaluate VLM compositional reasoning, showing that existing post-training methods struggle, while enhancing vision-text alignment and using progress rewards improves RL-based compositional ability.