2 papers across 2 sessions
We introduce Streaming Flow Matching, a novel streaming generative model for real-time audio generation from discrete tokens.
This paper proposes DCKD, a privileged knowledge distillation framework for target sound extraction that regulates the amount and flow of target information via neural codec and disentangled representation learning.