2 papers across 2 sessions
A new benchmark to evaluate visual caption for MLLMs with considering both correctness and thoroughness.
Proposed PMQ-VE, a new quantization method for video enhancement that achieves high performance with low-bit models.