Associate Professor, Seoul National University
1 paper at NeurIPS 2025
HiFC swaps LLM KV caches directly between GPU and pSLC-SSD, matching DRAM-level throughput while eliminating DRAM and slashing cost five-fold.