Researcher, OpenAI
3 papers at NeurIPS 2025
We propose a LLM agent framework to automate red teaming
We create a unified benchmark for evaluating secure code generation, vulnerability detection and poc generation
Nemotron-CLIMB automates data mixture optimization for pre-training, improving domain adaptation and outperforming Llama-3.2-1B by 2.0% on general reasoning.