PhD student, Nanyang Technological University
2 papers at NeurIPS 2025
We propose EffiBench-X, a multi-language code efficiency benchmark, to address the gap in existing benchmarks primarily focusing on a single programming language (e.g., Python).
This paper introduces a novel reasoning-based VLM guard model dubbed GuardReasoner-VL