3 papers across 2 sessions
We propose a novel self-improvement algorithm to teach language models to perform effective search.