Test Case Generation

2 papers across 1 session

Poster Session 6

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified Dataset

#3610 · Yifei Liu, Li Lyna Zhang, Yi Zhu, Bingcheng Dong, Xudong Zhou, Ning Shang, Fan Yang, Cheng Li, Mao Yang

We introduce rStar-Coder to train advanced code reasoning LLMs, with our 14B model achieving comparable performance to QWQ-32B.

Rethinking Verification for LLM Code Generation: From Generation to Testing

#106 · Zihan Ma, Taolin Zhang, Maosongcao, Junnan Liu, Wenwei Zhang, Minnan Luo, Songyang Zhang, Kai Chen

Current LLM code evaluation is flawed by weak test cases; we propose SAGA, a novel method using human expertise to generate superior verifiers, demonstrated by our new CodeComPass benchmark and TCGCoder-7B model, for more reliable assessment.