Full Professor, Tsinghua University, Tsinghua University
1 paper at NeurIPS 2025
This work presents DAIL, a semantic aligned reinforcement learning method that improves task discrimination and generalization in language-conditioned problems.