residual stream

2 papers across 2 sessions

Poster Session 2

Wednesday, December 3, 2025 · 4:30 PM → 7:30 PM

Do Language Models Use Their Depth Efficiently?

#4011 · Róbert Csordás, Christopher D Manning, Chris Potts

We analyze the effective depth of LLMs and find that they are unlikely to compose subresults, and deeper models spread out the same type of computation as the shallow ones.

Poster Session 5

1 paper

Friday, December 5, 2025 · 11:00 AM → 2:00 PM

Exhibit Hall C,D,E

Residual Stream Analysis of Overfitting And Structural Disruptions

#1200 · Quan Liu, Han Zhou, Wenquan Wu, Hua Wu, Sen Su