Making Visual Object Categorization More Challenging: Randomized Caltech-101 Data Set

Kinnunen, Teemu; Kämäräinen, Joni-Kristian; Lensu, Lasse; Lankinen, Jukka; Kälviäinen, Heikki

doi:10.1109/icpr.2010.124

Cited by 26 publications

(13 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It is important to observe who knows the ground truth, and why and how. To facilitate these steps in general, a lot of research methods exist: randomized hough transform (RHT) for geometric primitives detection [4][5][6][7][8][9][10][11][12][13], Gabor filtering for object detection [14][15][16][17][18], Gaussian mixture models for object classification [19,20], SOMand PCA-based image compression and representation of spectral images [21][22][23], surface analysis for 2D and 3D images [24][25][26], unsupervised methods for visual object categorization (VOC) [27][28][29][30][31], tracking methods for computer vision [32][33][34] It must be considered whether there are challenges to be expected in imaging and whether multimodal information is needed.…”

Section: From Human Vision To Machine Visionmentioning

confidence: 99%

“…Different areas, that is, objects, are considered to affect the quality differently. We have generated the Randomized Caltech-101 image set [28] (with the known ground truth) and the Abstract image set [30], including human opinions as "the ground truth" [46] to test the quality of our approach. considered in Figure 11.8 where unsupervised categorization means that object categories are not known beforehand, and thus the most important ones are detected without supervision.…”

Section: Image Quality Assessment and Visual Object Categorizationmentioning

confidence: 99%

See 1 more Smart Citation

From pattern recognition methods to machine vision applications

Kälviäinen

2015

Advances in Independent Component Analysis and Learning Machines

View full text Add to dashboard Cite

Section: From Human Vision To Machine Visionmentioning

confidence: 99%

Section: Image Quality Assessment and Visual Object Categorizationmentioning

confidence: 99%

From pattern recognition methods to machine vision applications

Kälviäinen

2015

Advances in Independent Component Analysis and Learning Machines

View full text Add to dashboard Cite

“…Experiments were performed on the Caltech101 [11] and RandCaltech101 [23] datasets. Caltech101 contains 9144 images, most of them in medium resolution (300×300 pixels).…”

Section: Datasetsmentioning

confidence: 99%

“…RandCaltech101 is obtained from Caltech101 by randomly modifying the backgrounds and the posture (position, orientation) of objects. It has been shown [23] that classification is more challenging on RandCaltech101 than on Caltech101.…”

Section: Datasetsmentioning

confidence: 99%

Semantic-enriched visual vocabulary construction in a weakly supervised context

Rizoiu

Velcin

Lallich

2015

IDA

View full text Add to dashboard Cite

One of the prevalent learning tasks involving images is content-based image classification. This is a difficult task especially because the low-level features used to digitally describe images usually capture little information about the semantics of the images. In this paper, we tackle this difficulty by enriching the semantic content of the image representation by using external knowledge. The underlying hypothesis of our work is that creating a more semantically rich representation for images would yield higher machine learning performances, without the need to modify the learning algorithms themselves. The external semantic information is presented under the form of non-positional image labels, therefore positioning our work in a weakly supervised context. Two approaches are proposed: the first one leverages the labels into the visual vocabulary construction algorithm, the result being dedicated visual vocabularies. The second approach adds a filtering phase as a pre-processing of the vocabulary construction. Known positive and known negative sets are constructed and features that are unlikely to be associated with the objects denoted by the labels are filtered. We apply our proposition to the task of content-based image classification and we show that semantically enriching the image representation yields higher classification performances than the baseline representation.

show abstract

“…Moreover, some classes have virtually no background or the background remains the same. These problems make the data set bad for comparing alignment methods and for this reason, we also report the results for the recently published randomised Caltech-101 (r-Caltech-101) [15]. In r-Caltech-101 the backgrounds have been replaced with random Google landscape images and the objects transformed to random poses (scale, translation, rotation).…”

Section: Performance Evaluationmentioning

confidence: 99%

Local Feature Based Unsupervised Alignment of Object Class Images

Lankinen

Kämäräinen

2011

Procedings of the British Machine Vision Conference 2011

Self Cite

View full text Add to dashboard Cite

Alignment of objects is a predominant problem in part-based methods for visual object categorisation (VOC). These methods should learn the parts and their spatial variation, which is difficult for objects in arbitrary poses. A straightforward solution is to annotate images with a set of "object landmarks", but due to laborious manual annotation, semi-supervised methods requiring only a set of images and class labels are preferred. Recent state-of-the-art VOC methods utilise various approaches to align objects or otherwise compensate their geometric variation, but no explicit solution to the alignment problem with quantitative results can be found.The problem has been studied in the recent works related to "image congealing". The congealing methods, however, are based on image-based processing, and thus require moderate initial alignment and are sensitive to intra-class variation and background clutter. In this work, we define a local feature based algorithm to rigidly align object class images. Our algorithm is based on the standard VOC tools: local feature detectors and descriptors, correspondence based homography estimation, and random sample consensus (RANSAC) based spatial validation of local features. We first demonstrate how an intuitive feature matching approach works for simple classes, but fails for more complex ones. This is solved by a spatial scoring procedure which is the core element in the proposed method. Our method is compared to a state-of-the-art congealing method with realistic and difficult Caltech-101 and randomised Caltech-101 (r-Caltech-101) categories for which our method achieves clearly superior performance.

show abstract

Making Visual Object Categorization More Challenging: Randomized Caltech-101 Data Set

Cited by 26 publications

References 10 publications

From pattern recognition methods to machine vision applications

From pattern recognition methods to machine vision applications

Semantic-enriched visual vocabulary construction in a weakly supervised context

Local Feature Based Unsupervised Alignment of Object Class Images

Contact Info

Product

Resources

About