Researcher, xiaohongshu
1 paper at NeurIPS 2025
We propose a novel hybrid of position embedding to improve the length generalization ability of Vision-Language Models.