1 paper across 1 session
We introduce a method that encourages LMs to leverage their pretrained knowledge during post-training.