1 paper across 1 session
We propose SignViP which incorporate multiple fine-grained conditions with a discrete tokenization paradigm for improved generation fidelity.