today local_bar

Dongsuk Jeon

Associate Professor, Seoul National University

1 paper at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

HiFC: High-efficiency Flash-based KV Cache Swapping for Scaling LLM Inference

#4204 · Inho Jeong, Sunghyeon Woo, Sol Namkung, Dongsuk Jeon

HiFC swaps LLM KV caches directly between GPU and pSLC-SSD, matching DRAM-level throughput while eliminating DRAM and slashing cost five-fold.