2 papers across 2 sessions
We develop an adaptive image tokenizer that compresses images into variable-sized latent features based on its content complexity.