PhD student, UIUC
3 papers at NeurIPS 2025
Enhancing long video understanding via extreme compression by reducing each selected frame to a single token.
AgMMU is a challenging real‑world benchmark for evaluating and advancing vision-language models (VLMs) in the knowledge‑intensive domain of agriculture.