Associate Professor, University of Alberta
1 paper at NeurIPS 2025
A new benchmark for assessing VLM’s capabilities in real-world video game code assurance tasks.