2012
DOI: 10.1016/j.neuron.2012.01.010
|View full text |Cite
|
Sign up to set email alerts
|

How Does the Brain Solve Visual Object Recognition?

Abstract: Mounting evidence suggests that “core object recognition,” the ability to rapidly recognize objects despite substantial appearance variation, is solved in the brain via a cascade of reflexive, largely feedforward computations that culminate in a powerful neuronal representation in the inferior temporal cortex. However, the algorithm that produces this solution remains little-understood. Here we review evidence ranging from individual neurons, to neuronal populations, to behavior, to computational models. We pr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

83
1,278
6
4

Year Published

2015
2015
2024
2024

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 1,497 publications
(1,441 citation statements)
references
References 187 publications
83
1,278
6
4
Order By: Relevance
“…While the visual cortex of BNNs may use quite different learning algorithms, its objective function to be minimised may be quite similar to the one of visual ANNs. In fact, results obtained with relatively deep artificial DBNs (Lee et al, 2007b) and CNNs (Yamins et al, 2013) seem compatible with insights about the visual pathway in the primate cerebral cortex, which has been studied for many decades (e.g., Hubel and Wiesel, 1968;Perrett et al, 1982;Desimone et al, 1984;Felleman and Van Essen, 1991;Perrett et al, 1992;Kobatake and Tanaka, 1994;Logothetis et al, 1995;Bichot et al, 2005;Hung et al, 2005;Lennie and Movshon, 2005;Connor et al, 2007;Kriegeskorte et al, 2008;DiCarlo et al, 2012); compare a computer vision-oriented survey (Kruger et al, 2013).…”
Section: Consequences For Neurosciencementioning
confidence: 81%
“…While the visual cortex of BNNs may use quite different learning algorithms, its objective function to be minimised may be quite similar to the one of visual ANNs. In fact, results obtained with relatively deep artificial DBNs (Lee et al, 2007b) and CNNs (Yamins et al, 2013) seem compatible with insights about the visual pathway in the primate cerebral cortex, which has been studied for many decades (e.g., Hubel and Wiesel, 1968;Perrett et al, 1982;Desimone et al, 1984;Felleman and Van Essen, 1991;Perrett et al, 1992;Kobatake and Tanaka, 1994;Logothetis et al, 1995;Bichot et al, 2005;Hung et al, 2005;Lennie and Movshon, 2005;Connor et al, 2007;Kriegeskorte et al, 2008;DiCarlo et al, 2012); compare a computer vision-oriented survey (Kruger et al, 2013).…”
Section: Consequences For Neurosciencementioning
confidence: 81%
“…This results in an increase in receptive field (RF) size and concurrently an increase in the specificity of tuning (Zeiler & Fergus, 2014). This increase of receptive field size and tuning specificity traversing the layers resemble the general architecture of feed-forward visual representations in the human brain (DiCarlo, Zoccolan, & Rust, 2012;Lamme & Roelfsema, 2000).…”
mentioning
confidence: 88%
“…First, divisive normalization computes outputs that are sensitive to relevant but not irrelevant stimulus dimensions because in discarding information about mean levels of activity it computes representations that are invariant across changes in that mean (Carandini and Heeger, 2012). Therefore it can make a major contribution to solving core problems of invariant object recognition (DiCarlo, Zoccolan and Rust, 2012). Second, amplifying modulation contributes to invariant object recognition by making it context-sensitive.…”
Section: Perceptual Invariance and Adaptive Modification Of Rf Selectmentioning
confidence: 99%