Research Lead, ByteDance
2 papers at NeurIPS 2025
Given an input mesh, Puppeteer first transforms it into an animation-ready model through automatic rigging, and subsequently animates it under video guidance.
We propose SuperCLIP, a simple and efficient extension to CLIP that adds classfication-based supervision to improve fine-grained image-text alignment without requiring extra annotations or significant computation.