4 papers across 3 sessions
We present 3EED, the first large-scale benchmark for 3D visual grounding across vehicles, drones, and quadrupeds, with over 134K 3D objects and 25K human-verified expressions in diverse outdoor scenes.
We present a hierarchical representation learning method using hyperbolic spaces for Neural Radiance Field.