1 paper across 1 session
We introduce MGAudio, a flow-based framework for video-to-audio generation that leverages model-guided dual-role alignment to achieve state-of-the-art performance.