Goal-conditioned RL

4 papers across 2 sessions

Poster Session 4

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities

#315 · Kevin Wang, Ishaan Javali, Michał Bortkiewicz, Tomasz Trzcinski, Benjamin Eysenbach

While most RL methods use shallow MLPs (~2–5 layers), we show that scaling up to 1000-layers for contrastive RL (CRL) can significantly boost performance, ranging from doubling performance to 50x on a diverse suite of robotic tasks.

Compositional Monte Carlo Tree Diffusion for Extendable Planning

#3712 Spotlight · Jaesik Yoon, Hyeonseo Cho, Sungjin Ahn

C-MCTD enables diffusion planners to generate plans 10× longer than training examples by systematically stitching together shorter plans through tree search.

Poster Session 5

2 papers

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Option-aware Temporally Abstracted Value for Offline Goal-Conditioned Reinforcement Learning

#304 Spotlight · Hongjoon Ahn, Heewoong Choi, Jisu Han, Taesup Moon

We propose a novel value function learning scheme for hierarchical policy in offline GCRL

Fast Monte Carlo Tree Diffusion: 100× Speedup via Parallel and Sparse Planning

#3609 Spotlight · Jaesik Yoon, Hyeonseo Cho, Yoshua Bengio, Sungjin Ahn

Fast Monte Carlo Tree Diffusion (Fast-MCTD) achieves up to 100× speedup over MCTD through parallel rollouts and sparse trajectory planning, maintaining strong performance in complex long-horizon tasks while being computationally efficient.