2 papers across 2 sessions
HiFC swaps LLM KV caches directly between GPU and pSLC-SSD, matching DRAM-level throughput while eliminating DRAM and slashing cost five-fold.
A benchmark dataset of oil and gas video ads