Text to Image Generation

4 papers across 3 sessions

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Preventing Shortcuts in Adapter Training via Providing the Shortcuts

#4406 · Anujraaj Goyal, Guocheng Qian, Huseyin Coskun, Aarush Gupta, Himmy Tam, Daniil Ostashev, Ju Hu, Dhritiman Sagar, Sergey Tulyakov, Kfir Aberman, Kuan-Chieh Wang

We find that rerouting spurious shortcuts in adapter training enables robust disentanglement for text-to-image generation with adapters.

Poster Session 2

1 paper

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control

#4300 · Danfeng Li, Hui Zhang, Sheng Wang, Jiacheng Li, Zuxuan Wu

Poster Session 6

2 papers

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

Flow-GRPO: Training Flow Matching Models via Online RL

#4208 · Jie Liu, Gongye Liu, Jiajun Liang, Yangguang Li, Jiaheng Liu, Xintao Wang, Pengfei Wan, Di ZHANG, Wanli Ouyang

We propose Flow-GRPO, the first method to integrate online RL into flow matching models, significantly enhancing text-to-image generation performance.

Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback

#3608 · Yi-Lun Wu, Bo-Kai Ruan, Chiang Tseng, Hong-Han Shuai

We present a learning framework that aligns text-to-image diffusion models with human preferences through inverse reinforcement learning and a balance of offline and online training.