PhD student, Massachusetts Institute of Technology
2 papers at NeurIPS 2025
The paper proves that a two-layer, single-head transformer can reliably perform in-context learning on any-order Markov chains.
We built a generalist clinical foundation model across both time series and vision modalities with a novel RL training algorithm named DRPO.