PhD student, The University of Hong Kong
3 papers at NeurIPS 2025
We introduce ACM, a framework that enhances model merging by incorporating layer-specific merging coefficients based on activation mutual information.
We propose TreeSynth, a tree-guided subspace-based data synthesis approach, achieving superior data diversity, model performance, robust scalability, and data balance efficacy.