3 papers across 3 sessions
We introduce Early Vision Networks (EVNets), hybrid CNNs combining biologically inspired subcortical and cortical front-end blocks, which improve V1 alignment and significantly boost robustness to different image perturbations.
This paper presents InstructSAM, a training-free framework that counts, detects, and segments objects in remote sensing imagery based on tailored instructions.