Researcher, Cerebras Systems, Inc
1 paper at NeurIPS 2025
We introduce CompleteP, which offers depth-wise HP transfer, FLOP savings when training deep models, and a larger range of compute-efficient width/depth ratios.