Multi-task Reinforcement Learning

3 papers across 2 sessions

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Constructing an Optimal Behavior Basis for the Option Keyboard

#411 · Lucas N. Alegre, Ana Bazzan, Andre Barreto, Bruno Silva

A method for constructing an optimal behavior basis for the Option Keyboard, enabling zero-shot identification of optimal solutions for any linear-reward task.

Meta-World+: An Improved, Standardized, RL Benchmark

#403 · Reginald McLean, Evangelos Chatzaroulas, Luc McCutcheon, Frank Röder, Tianhe Yu, Zhanpeng He, K.R. Zentner, Ryan Julian, J Terry, Isaac Woungang, Nariman Farsad, Pablo Samuel Castro

Undocumented versions of Meta-World have clouded algorithmic performance. This work strives to disambiguate Meta-World results from the literature, while also providing insights into benchmark design.

Poster Session 3

1 paper

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Adaptable Safe Policy Learning from Multi-task Data with Constraint Prioritized Decision Transformer

#208 · Ruiqi Xue, Ziqian Zhang, Lihe Li, Cong Guan, Lei Yuan, Yang Yu

This paper introduces CoPDT, a method of using one unified and adaptable DT model for multi-task (multi-budget or multi-constraint) offline safe RL.