Proceedings of the 20th ACM International Conference on Multimedia 2012
DOI: 10.1145/2393347.2396367
|View full text |Cite
|
Sign up to set email alerts
|

Efficient mobile landmark recognition based on saliency-aware scalable vocabulary tree

Abstract: In recent years, the Scalable Vocabulary Tree (SVT) has been shown to be effective in image recognition. However, in mobile landmark image recognition where the foreground is the landmark to be recognized while the background is cluttered, the current SVT framework ignores different local importance of image, hence restricting its performance. In this paper, we propose a new landmark recognition framework that can incorporate saliency information to improve the recognition performance relative to the baseline … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
21
0

Year Published

2014
2014
2017
2017

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(21 citation statements)
references
References 7 publications
0
21
0
Order By: Relevance
“…We propose a re-ranking method to improve the performance of mobile landmark recognition systems that incorporate the saliency information and the SVT image recognition [2].…”
Section: Proposed Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…We propose a re-ranking method to improve the performance of mobile landmark recognition systems that incorporate the saliency information and the SVT image recognition [2].…”
Section: Proposed Methodsmentioning
confidence: 99%
“…Graph-Based Visual Saliency (GBVS) [9,2] is one representative process in visual saliency modeling. It consists of two steps: first, forming an activation map A on the certain feature map F, and then normalizing it in a way which highlights conspicuity locations of the saliency.…”
Section: Proposed Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…SVT approaches can be regarded as an extension of Bagof-Words (BoW) approaches since the visual words can be easily extended to tens of thousand at a logarithmic scale [10][11][12][13]. In a typical image retrieval framework [1], a scalable vocabulary tree (SVT) is generated by hierarchically clustering local descriptors.…”
Section: Introductionmentioning
confidence: 99%