1 paper across 1 session
We provide the first proof showing that pause tokens (such as "...") appended to the input of a Transformer can strictly increase its expressivity.