PhD student, Australian National University
1 paper at NeurIPS 2025
We learn offline meta-policies from natural language supervision with contrastive language-decision pre-training, aligning text embeddings to comprehend environment dynamics.