4 papers across 3 sessions
A reliable driving world model that can be accurately steered with various actions with realistic visual simulation.
We present Pixel-Perfect Depth, a monocular depth estimation model based on pixel-space diffusion transformers.