A Self-Supervised Deep Neural Network for Image Completion Resembles Early Visual Cortex fMRI Activity Patterns for Occluded Scenes

Svanera, Michele; Morgan, Andrew; Petro, Lucy S.; Muckli, Lars

doi:10.1101/2020.03.24.005132

Cited by 1 publication

(1 citation statement)

References 62 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The generative nature of this method makes it suitable for modeling different top-down modulations and feedback processing. To date, these models have been used to study effects of top-down feedback in ventral pathway [76, 2] and to model predictive coding [37], mental imagery [7] and continual learning [83]. More generally, these models can also be used for representation learning, where they can be trained using self-supervised methods to generate the visual input.…”

Section: Discussionmentioning

confidence: 99%

A brain-inspired object-based attention network for multi-object recognition and visual reasoning

Adeli

Ahn

Zelinsky

2022

Preprint

View full text Add to dashboard Cite

The visual system uses sequences of selective glimpses to objects to support behavioral goals, but how is this attention control learned? Here we present an encoder-decoder model inspired by the interacting bottom-up and top-down visual pathways making up the recognition-attention system in the brain. At every iteration, a new glimpse is taken from the image and is processed through the 'what' encoder, a hierarchy of feedforward, recurrent, and capsule layers, to obtain an object-centric (object-file) representation. This representation feeds to the 'where' decoder, where the evolving recurrent representation provides top-down attentional modulation to plan subsequent glimpses and impact routing in the encoder. We demonstrate how the attention mechanism significantly improves the accuracy of classifying highly overlapping digits. In a visual reasoning task requiring comparison of two objects, our model achieves near-perfect accuracy and significantly outperforms larger models in generalizing to unseen stimuli. Our work demonstrates the benefits of object-based attention mechanisms taking sequential glimpses of objects.

show abstract