Dan Alistarh

Full Professor, Institute of Science and Technology

5 papers at NeurIPS 2025

Homepage· OpenReview· Semantic Scholar· Google Scholar

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

#1906 Spotlight · Gleb Rodionov, Roman Garipov, Alina Shutova, George Yakushev, Erik Schultheis, Vage Egiazarian, Anton Sinitsin, Denis Kuznedelev, Dan Alistarh

We propose a parallel generation method for LLMs, where multiple instances synchronize through a shared, dynamically-updated attention cache

Efficient Data Selection at Scale via Influence Distillation

#3409 · Mahdi Nikdan, Vincent Cohen-Addad, Dan Alistarh, Vahab Mirrokni

Influence Distillation is a mathematically justified data selection method for LLM fine-tuning that assigns optimal weights to training samples, achieving performance on par with or better than state-of-the-art while being substantially faster.

Poster Session 3

1 paper

Thursday, December 4, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

#3406 · Roberto L. Castro, Andrei Panferov, Soroush Tabesh, Oliver Sieberling, Jiale Chen, Mahdi Nikdan, Saleh Ashkboos, Dan Alistarh

We provide a method for accurate end-to-end FP4 training of Large Language Models.

Poster Session 6

2 papers

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

HALO: Hadamard-Assisted Lower-Precision Optimization for LLMs

#3302 · Saleh Ashkboos, Mahdi Nikdan, Soroush Tabesh, Roberto L. Castro, Torsten Hoefler, Dan Alistarh

A low-precision scheme for fine-tuning LLMs

Unified Scaling Laws for Compressed Representations

#1505 · Andrei Panferov, Alexandra Volkova, Ionut-Vlad Modoranu, Vage Egiazarian, Mher Safaryan, Dan Alistarh

We investigate new scaling laws which predict the scaling of LLMs when training them over quantized or sparse representations.