1 paper across 1 session
Substantially faster diffusion LLMs using a small auxiliary autoregressive model