2015
DOI: 10.1007/s00422-015-0658-2
|View full text |Cite
|
Sign up to set email alerts
|

Invariant visual object recognition: biologically plausible approaches

Abstract: Key properties of inferior temporal cortex neurons are described, and then, the biological plausibility of two leading approaches to invariant visual object recognition in the ventral visual system is assessed to investigate whether they account for these properties. Experiment 1 shows that VisNet performs object classification with random exemplars comparably to HMAX, except that the final layer C neurons of HMAX have a very non-sparse representation (unlike that in the brain) that provides little information… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
30
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
6
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 31 publications
(34 citation statements)
references
References 120 publications
(391 reference statements)
1
30
0
Order By: Relevance
“…On the other hand, each neuron does respond very much more to some stimuli than to many others, and in this sense is tuned to some stimuli. This type of representation is not found in some models of invariant object representation in the visual cortical areas such as HMAX, but is approximated in VisNet (Robinson & Rolls, 2015). (HMAX C layer neurons are extremely broadly tuned (Robinson & Rolls, 2015).…”
Section: The Speed Of Information Processing In the Temporal Corticalmentioning
confidence: 99%
See 1 more Smart Citation
“…On the other hand, each neuron does respond very much more to some stimuli than to many others, and in this sense is tuned to some stimuli. This type of representation is not found in some models of invariant object representation in the visual cortical areas such as HMAX, but is approximated in VisNet (Robinson & Rolls, 2015). (HMAX C layer neurons are extremely broadly tuned (Robinson & Rolls, 2015).…”
Section: The Speed Of Information Processing In the Temporal Corticalmentioning
confidence: 99%
“…(HMAX C layer neurons are extremely broadly tuned (Robinson & Rolls, 2015). When interpreted by a machine learning tool such as support vector machine learning, the representation that is required by the machine learning is just that specified by the experimenter, which might be for one object, or typically for any example of a single class of object such as hats or bears (Robinson & Rolls, 2015). )…”
Section: The Speed Of Information Processing In the Temporal Corticalmentioning
confidence: 99%
“…CNNs can not be considered realistic models of the ventral stream, because they learn in very different ways, consist of much simpler units, typically lack lateral connections, etc. [11]. However, the fact that they have realistic core object recognition capabilities, and also exhibit internal representations that resemble those in the ventral stream, constrains ideas about the roles of these missing details in vision.…”
Section: Introductionmentioning
confidence: 99%
“…However, the results described here show that even if that were the case, then temporal trace learning would be very helpful in allowing neurons to achieve useful and accurate invariance. However, the concept in VisNet is that temporal trace learning is sufficient to learn transform invariant representations of objects in the ventral visual system, with the great advantage that this mechanism can also account for other forms of invariance, including view‐invariant representations, which cannot be learned by a spatial coordinate transform (as different views of a given object may be completely different), but which VisNet with its temporal short‐term memory trace approach learns well (Bart & Hegde, ; Perry, Rolls, & Stringer, ; Robinson & Rolls, ; Rolls, , ; Rolls & Mills, ; Rolls & Stringer, ; Rolls & Webb, ; Wallis & Rolls, ; Webb & Rolls, ; Zhao et al, ).…”
Section: Discussionmentioning
confidence: 99%