1 paper across 1 session
We propose a new method for interpretating transformer circuit by performing SVD on query-value and value-output matrices