Yutong Bai

Postdoc, University of California, Berkeley

2 papers at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 1

1 paper

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Whole-Body Conditioned Egocentric Video Prediction

#4401 · Yutong Bai, Danny Tran, Amir Bar, Yann LeCun, Trevor Darrell, Jitendra Malik

We introduce a diffusion-based video model that predicts egocentric futures from full-body 3D motion, enabling realistic and controllable first-person simulation.

Poster Session 2

1 paper

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

REOrdering Patches Improves Vision Models

#4701 · Declan Kutscher, David M. Chan, Yutong Bai, Trevor Darrell, Ritwik Gupta

The way we rasterize images to 1D sequences to feed into long sequence models is sub-optimal! We show that orders other than row major can be better, and provide an RL method to learn the optimal ordering.