2 papers across 2 sessions
we release a cognitively-inspired benchmark for reasoning across scenes that reveals hallucination is an open challenge for multimodal models
A systematic fixation-level comparison of a performance-optimized DNN scanpath model and a mechanistic cognitive reveals behaviourally relevant mechanisms that can be added to the mechanistic model to substantially improve performance