“…In the image domain, several problems have successfully been modelled and solved using strings rather than local feature vectors, e.g. text recognition [9,10,11], shape matching [12,13], image classification [14,15,16,17] and video classification [18]. In this section, we focus on the related work that addresses the question of spatial information and topological relationships within SPR and CNN frameworks for image classification.…”