PhD student, Hong Kong University of Science and Technology
2 papers at NeurIPS 2025
We propose the first large-scale and real-world 360 dataset and a automatic annotate pipeline to reduce the cost of manual annotation.
Exploring and Mitigating hallucination in LMMs towards scene text spotting and understanding