Large-language Models

3 papers across 3 sessions

Poster Session 1

Wednesday, December 3, 2025 · 11:00 AM → 2:00 PM

Escaping Collapse: The Strength of Weak Data for Large Language Model Training

#4008 · Kareem Amin, Sara Babakniya, Alex Bie, Weiwei Kong, Umar Syed, Sergei Vassilvitskii

We show that highly accurate LLMs can be learned from training sets consisting entirely of synthetic data and weakly curated data.

Poster Session 5

1 paper

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Anchored Diffusion Language Model

#3604 · Litu Rout, Constantine Caramanis, Sanjay Shakkottai

We propose the Anchored Diffusion Language Model (ADLM), a novel two-stage framework that generates an important token mixture which guides the prediction of missing likelihoods, resulting in better likelihood modeling and generated text quality.

Poster Session 6

1 paper

Friday, December 5, 2025 · 4:30 PM → 7:30 PM

Exhibit Hall C,D,E

CADMorph: Geometry‑Driven Parametric CAD Editing via a Plan–Generate–Verify Loop

#2417 · Weijian Ma, Shizhao Sun, Ruiyu Wang, Jiang Bian

We propose CADMorph, an inference-time editing method for parametric CAD models using the signal of geometry changes via a Plan–Generate–Verify Loop over pretrained priors, namely a LDM and an LLM, bypassing the need of non-existent editing data.