4 papers across 2 sessions
A novel framework that combines visual priors and dynamic constraints within a synchronized diffusion process for HOI video and motion generation.
MEgoHand is the starting point for generating high-quality motion sequences of hand-object interactions, conditioned on egocentric RGB images, textual instructions, and given initial MANO hand parameters.
We propose HACO, a framework for dense hand contact estimation that addresses class and spatial imbalance challenges in training on large-scale datasets.