1 paper across 1 session
Small KL-divergence fails to ensure similarity of representations; we propose a distance which does and demonstrate it empirically.