Assistant Professor, University of Maryland, College Park
2 papers at NeurIPS 2025
This paper introduces a few-shot task-aware knowledge distillation method that leverages counterfactual explanations to improve model performance with fewer data samples.
We introduce T-SHIRT, a new data selection method for instruction tuning LLMs that scores data at the token level and emphasizes robustness.