PhD student, Massachusetts Institute of Technology
1 paper at NeurIPS 2025
We present JetLM, a new family of LMs, which matches leading full-attention models while significantly improving generation throughput.