Researcher, Electronic Arts
1 paper at NeurIPS 2025
A new benchmark for assessing VLM’s capabilities in real-world video game code assurance tasks.