3 papers across 3 sessions
We combine LLM-synthesized performance-characterizing constraints with fuzzing to uncover difficult-to-find code inefficiencies and generate performance-stressing tests.
GSO: SWE Agents Struggle at Reasoning and Engineering for Software Optimization
We propose a lesson-based framework for multiple LLM agents to collaboratively solve coding problems by learning from each other.