out-of-context reasoning

1 paper across 1 session

Poster Session 2

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

VLMs can Aggregate Scattered Training Patches

#5207 · Zhanhui Zhou, Lingjie Chen, Chao Yang, Chaochao Lu

We show that vision-language models can piece together scattered training image patches—a capability we call visual stitching—which enables generalization to unseen images but also introduces new safety risks.