“…Although the complete dimensionality of a single view representation x i is thus (50+3)x18x18=17172, the effective dimensionality is much smaller, due to the sparsity of the representation vector and the confinement of activation to the figure-ground mask. Nevertheless it is a key feature of our biologically motivated visual processing model that robustness, generalization and speed of learning is not achieved by a dimension reduction as in most other current online learning models [8,9,3,10,11,12,14]. The key element is a transformation of the input into a sparse robust feature map representation that captures relevant locally invariant structures of the objects.…”