2011 International Conference on Computer Vision 2011
DOI: 10.1109/iccv.2011.6126283
|View full text |Cite
|
Sign up to set email alerts
|

Recognizing jumbled images: The role of local and global information in image classification

Abstract: The performance of current state-of-the-art computer vision algorithms at image classification falls significantly short as compared to human abilities. To reduce this gap, it is important for the community to know what problems to solve, and not just how to solve them. Towards this goal, via the use of jumbled images, we strip apart two widely investigated aspects: local and global information in images, and identify the performance bottleneck.Interestingly, humans have been shown to reliably recognize jumble… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
16
0

Year Published

2012
2012
2016
2016

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 32 publications
(17 citation statements)
references
References 26 publications
1
16
0
Order By: Relevance
“…This suggests that one of the weaknesses of both the BoB and BoN representations is their lack of explicit encoding of the geometric relationship between different descriptor words. Similar findings have been reported in the context of local descriptor based representations of textured objects [9]. We also investigated the possibility of a simple decision level combination of the two representations.…”
Section: Discussionsupporting
confidence: 57%
“…This suggests that one of the weaknesses of both the BoB and BoN representations is their lack of explicit encoding of the geometric relationship between different descriptor words. Similar findings have been reported in the context of local descriptor based representations of textured objects [9]. We also investigated the possibility of a simple decision level combination of the two representations.…”
Section: Discussionsupporting
confidence: 57%
“…1, 4 th row [3]. Indeed, Parikh [23] showed a a majority-vote accumulation over human classification of the individual blocks is a good predictor of human responses of the entire jumbled images. This dataset contains human performances on 3 image sets: 1) OSR, 384 outdoor scenes from the 8 categories of 8-CAT, 2) ISR, 300 indoor scenes [5] from bathroom, bedroom, dining room, gym, kitchen, living room, theater and staircase categories, and 3) CAL: Caltech objects (50 images from each of 6 categories aeroplane, car-rear, face, ketch, motorbike, and watch).…”
Section: Human Studiesmentioning
confidence: 99%
“…Human performance on jumbled images depends on the level of image blocking [23] (here 65%). Model accuracies (trained and tested on jumbled images) are shown in Fig.…”
Section: Test 4: Local or Global Information: Recognition Of Jumbled mentioning
confidence: 99%
“…In the past years, many techniques have been proposed to address the problem of image classification [1][2][3][4][5][6][7][8] . There are two key assumptions in these algorithmic techniques: the first assumption is that images in the database are usually distributed in the Euclidean space, and the second one is that the dissimilarity-based matching is based on the pairwise measure.…”
Section: Introductionmentioning
confidence: 99%