2 papers across 2 sessions
FastSVERL provides a practical and scalable approach for principled, rigourous interpretability in reinforcement learning.
We improve the fine-tuning performance of reasoning LLMs by identifying the critical steps