Multimodal Large Language Models (MLLMs); Discrete Flow Matching - NeurIPS 2025

today local_bar

Multimodal Large Language Models (MLLMs); Discrete Flow Matching

1 paper across 1 session

Poster Session 4

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities

#4012 Spotlight · Jin Wang, Yao Lai, Aoxue Li, Shifeng Zhang, Jiacheng Sun, Ning Kang, Chengyue Wu, Zhenguo Li, Ping Luo

a unified multimodal model purely based on discrete flow matching, achieving comparable performance with AR-based MLLMs