2 papers across 2 sessions
new decoding method for diffusion LLM, alternative of semi-AR
Statistical prediction may be sufficient to drive the emergence of internal causal models and causal inference capacities in deep neural networks.