1 paper across 1 session
A versatile data mixture ratio optimization framework for LLM training that enjoy both theoretical and practical advantages.