2018
DOI: 10.1101/240614
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Large-scale, high-resolution comparison of the core visual object recognition behavior of humans, monkeys, and state-of-the-art deep artificial neural networks

Abstract: 242 words 6Significance Statement 97 words 2 7

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

12
197
1

Year Published

2018
2018
2021
2021

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 130 publications
(210 citation statements)
references
References 39 publications
12
197
1
Order By: Relevance
“…Analogous to the brain's sensory network, DNNs perform complex computations through deep stacks of simple intra‐layer neural circuits. Thus, researchers have widely used DNN models to understand the human brain network, especially sensory brain networks (Eickenberg, Gramfort, Varoquaux, & Thirion, ; Guclu & van Gerven, ; Horikawa & Kamitani, ; Rajalingham et al, ; Yamins & DiCarlo, ). At the same time, DNNs are capable of discovering complex structures within high‐dimensional input data, and can transform these structures into abstract levels (LeCun et al, ).…”
Section: Discussionmentioning
confidence: 99%
“…Analogous to the brain's sensory network, DNNs perform complex computations through deep stacks of simple intra‐layer neural circuits. Thus, researchers have widely used DNN models to understand the human brain network, especially sensory brain networks (Eickenberg, Gramfort, Varoquaux, & Thirion, ; Guclu & van Gerven, ; Horikawa & Kamitani, ; Rajalingham et al, ; Yamins & DiCarlo, ). At the same time, DNNs are capable of discovering complex structures within high‐dimensional input data, and can transform these structures into abstract levels (LeCun et al, ).…”
Section: Discussionmentioning
confidence: 99%
“…For the macaque face-processing network, our study here could not find a CNN layer corresponding to the intermediate stage (ML). In addition, some recent studies have pointed out potential representational discrepancies between CNN and the ventral stream from behavioral consideration 19,23,[27][28][29] . (See also the related discussion on the 'computational gap' below.)…”
Section: Discussionmentioning
confidence: 99%
“…Although these network models are not constrained by experimental data, they have nonetheless been shown to provide an even better fit than earlier models to both behavioral 18,24,25 and electrophysiological 26,27 data (but see Ref. 28). These network architectures now achieve accuracy well beyond those of earlier computational models of the VC and are on par with or better than human accuracy during unspeeded image categorization tasks for both object 29 and face 30 recognition.…”
Section: Introductionmentioning
confidence: 99%