2 papers across 2 sessions
Achieving High Accuracy in Distributed Training Even with Aggressive Gradient Compression
Propose a scalable gradient compression algorithm for data attribution with sub-linear complexity that achieves competitive attribution results.