2000
DOI: 10.1038/81479
|View full text |Cite
|
Sign up to set email alerts
|

Models of object recognition

Abstract: Understanding how biological visual systems recognize objects is one of the ultimate goals in computational neuroscience. From the computational viewpoint of learning, different recognition tasks, such as categorization and identification, are similar, representing different trade-offs between specificity and invariance. Thus, the different tasks do not require different classes of models. We briefly review some recent trends in computational vision and then focus on feedforward, view-based models that are sup… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

9
349
1
1

Year Published

2008
2008
2015
2015

Publication Types

Select...
5
5

Relationship

1
9

Authors

Journals

citations
Cited by 564 publications
(366 citation statements)
references
References 56 publications
(55 reference statements)
9
349
1
1
Order By: Relevance
“…Note that the performance of both models significantly degrades when stimuli are not presented in the center of the receptive field, in marked contrast to the same decoding task applied in the medial superior temporal (MST) area (Mineault et al, 2012), suggesting that computations described in MT represent one element of what is likely a hierarchical computation. That is, the initial selectivity developed in MT is further refined and generalized across spatial positions in higher-level areas such as area MST, as has been suggested for analogous computations in other areas (Riesenhuber and Poggio, 2000).…”
Section: Selectivity and Coding Of Complex Optic Flowmentioning
confidence: 98%
“…Note that the performance of both models significantly degrades when stimuli are not presented in the center of the receptive field, in marked contrast to the same decoding task applied in the medial superior temporal (MST) area (Mineault et al, 2012), suggesting that computations described in MT represent one element of what is likely a hierarchical computation. That is, the initial selectivity developed in MT is further refined and generalized across spatial positions in higher-level areas such as area MST, as has been suggested for analogous computations in other areas (Riesenhuber and Poggio, 2000).…”
Section: Selectivity and Coding Of Complex Optic Flowmentioning
confidence: 98%
“…The response of an S3 unit can be thought of as encoding the similarity of the input to a previously encountered face at a particular viewpoint. The S3 units are analogous to the "view-tuned units" in [5,6].The cells in the C3 layer pool over the S3 cells preferring each viewpoint of the same face. Thus the pattern of activity over all the C3 units is a vector of similarities to previously encountered template faces invariantly of viewpoint.…”
Section: Simulation Proceduresmentioning
confidence: 99%
“…Because familiar real-world objects are more natural stimuli for the visual system, they might have more bound representations than objects that are made up of entirely dissociable low-level features that seem to be stored independently even at the lowest levels of the visual system (e.g., orientation, color, spatial frequency; Magnussen, 2000) and that can be attended separately at encoding (Maunsell & Treue, 2006). Research on object recognition and long-term memory provide some proposals regarding the underlying representations of real-world objects (e.g., Diana, Yonelinas, & Ranganath, 2007;DiCarlo & Cox, 2007;Hummel, 2000;Riesenhuber & Poggio, 2000). In particular, these models typically assume "bound" representations of real-world objects.…”
mentioning
confidence: 99%