Full Professor, Princeton University
3 papers at NeurIPS 2025
Our structured dataset allows us to analyze how model vision compares to human perception and to determine whether VLMs perform similar visual reasoning algorithms as humans can.
InFlux is the first real-world benchmark that provides per-frame ground truth camera intrinsics for videos with dynamic intrinsics, and current baselines struggle to predict accurate intrinsics on our benchmark.