Heng Ji

Full Professor, University of Illinois, Urbana-Champaign

5 papers at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 1

2 papers

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

DyMU: Dynamic Merging and Virtual Unmerging for Efficient Variable-Length VLMs

#5012 · Zhenhailong Wang, Senthil Purushwalkam, Caiming Xiong, Silvio Savarese, Heng Ji, Ran Xu

ToolRL: Reward is All Tool Learning Needs

#511 · Cheng Qian, Emre Can Acikgoz, Qi He, Hongru WANG, Xiusi Chen, Dilek Hakkani-Tür, Gokhan Tur, Heng Ji

The paper proposes a principled reward design framework for training LLMs on tool use via reinforcement learning, leading to significant gains over SFT and baseline models in generalization and performance.

Poster Session 3

2 papers

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

PARTONOMY: Large Multimodal Models with Part-Level Visual Understanding

#4909 Spotlight · Ansel Blume, Jeonghwan Kim, Hyeonjeong Ha, Elen Chatikyan, Xiaomeng Jin, Khanh Duy Nguyen, Nanyun Peng, Kai-Wei Chang, Derek Hoiem, Heng Ji

Introducing PARTONOMY and PLUM, a new benchmark and segmenting LMM that enable fine-grained, part-level visual reasoning by addressing architectural flaws in existing LMMs and setting a new standard for grounded multimodal understanding.

Variational Supervised Contrastive Learning

#5509 · Ziwen Wang, Jiajun Fan, Thao Nguyen, Heng Ji, Ge Liu

Variational supervised contrastive learning maximizes a posterior-weighted ELBO, replacing pairwise comparisons with class-level interactions for SOTA performance on image classification tasks.

Poster Session 4

1 paper

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Fire360: A Benchmark for Robust Perception and Episodic Memory in Degraded 360° Firefighting Video

#4606 Spotlight · Aditi Tiwari, Farzaneh Masoud, Dac Trong Nguyen, Jill Kraft, Heng Ji, Klara Nahrstedt

Fire360 is a benchmark of 360° firefighting videos for evaluating vision-language models under real-world degradation, introducing five tasks including Transformed Object Retrieval (TOR) for fire-damaged object matching.