Assistant Professor, University of Texas, Austin
2 papers at NeurIPS 2025
In the paper, we study the convergence rates of the maximum likelihood estimator of gating and prompt parameters of the softmax-contaminated MoE.
a new learning scheme for multi-modal LLM, LLAVA