1 paper across 1 session
We introduce PurpCode, the first post-training recipe for training safe code reasoning models using Deliberative Alignment.