today local_bar

Roman Garipov

Researcher, Yandex

2 papers at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

#1906 Spotlight · Gleb Rodionov, Roman Garipov, Alina Shutova, George Yakushev, Erik Schultheis, Vage Egiazarian, Anton Sinitsin, Denis Kuznedelev, Dan Alistarh

We propose a parallel generation method for LLMs, where multiple instances synchronize through a shared, dynamically-updated attention cache

Poster Session 4

Thursday, December 4, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

AutoJudge: Judge Decoding Without Manual Annotation

#2010 · Roman Garipov, Fedor Velikonivtsev, Ivan Ermakov, Ruslan Svirschevski, Vage Egiazarian, Max Ryabinin

Automatically detecting task-specific important tokens to accelerate speculative decoding