NAVER Cloud - NeurIPS 2025

🏛 NAVER Cloud

3 papers across 1 session

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

HiFC: High-efficiency Flash-based KV Cache Swapping for Scaling LLM Inference

#4204 · Inho Jeong, Sunghyeon Woo, Sol Namkung, Dongsuk Jeon

HiFC swaps LLM KV caches directly between GPU and pSLC-SSD, matching DRAM-level throughput while eliminating DRAM and slashing cost five-fold.

Diffusion Adaptive Text Embedding for Text-to-Image Diffusion Models

#3707 · Byeonghu Na, Minsang Park, Gyuwon Sim, Donghyeok Shin, HeeSun Bae, Mina Kang, Se Jung Kwon, Wanmo Kang, Il-chul Moon

We propose Diffusion Adaptive Text Embedding (DATE), which improves text-to-image diffusion models by dynamically refining text embeddings throughout the diffusion sampling process.

Exploiting Vocabulary Frequency Imbalance in Language Model Pre-training

#4106 · Woojin Chung, Jeonghoon Kim

Larger vocabulary lowers language modeling difficulty by facilitating models to learn non-i.i.d patterns in text more easily