1 paper across 1 session
We construct a reasoning-oriented geo-localization dataset from social media images and apply GRPO-based reinforcement learning to fine-tune large vision-language models, enhancing their location reasoning capabilities.