2012
DOI: 10.4236/ijis.2012.23008
|View full text |Cite
|
Sign up to set email alerts
|

Combining Generative/Discriminative Learning for Automatic Image Annotation and Retrieval

Abstract: In order to bridge the semantic gap exists in image retrieval, this paper propose an approach combining generative and discriminative learning to accomplish the task of automatic image annotation and retrieval. We firstly present continuous probabilistic latent semantic analysis (PLSA) to model continuous quantity. Furthermore, we propose a hybrid framework which employs continuous PLSA to model visual features of images in generative learning stage and uses ensembles of classifier chains to classify the multi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 28 publications
0
3
0
Order By: Relevance
“…Since there is no standard human pose depth image library, we builds a data set, including common human actions such as running, jumping, lifting, bending, knee flexion, and interaction. Random forest learning algorithm belongs to supervised learning; the data samples are a known category, and these samples need to be tagged [36][37][38][39]. The tagging method is to divide the human body into 11 parts, and the rest is the background; the approximate position of each part of the human body in the depth image is observed, and then, the position is tagged with the corresponding color.…”
Section: Tagging Body Partsmentioning
confidence: 99%
“…Since there is no standard human pose depth image library, we builds a data set, including common human actions such as running, jumping, lifting, bending, knee flexion, and interaction. Random forest learning algorithm belongs to supervised learning; the data samples are a known category, and these samples need to be tagged [36][37][38][39]. The tagging method is to divide the human body into 11 parts, and the rest is the background; the approximate position of each part of the human body in the depth image is observed, and then, the position is tagged with the corresponding color.…”
Section: Tagging Body Partsmentioning
confidence: 99%
“…The computation time is less than LSA. Li et al [19] have designed a hybrid model to overcome the semantic gap in image retrieval and for automatic image annotation. In this framework, continuous probabilistic latent semantic analysis (PLSA) method is used in productive stage of learning to generate visual features of images.…”
Section: Annotation-based Image Retrievalmentioning
confidence: 99%
“…With the development of artificial intelligence technology, multimodal fusion is an inevitable trend, and multimodal fusion can make more effective use of the characteristics of the image. In practice, a large amount of labeled data is difficult to obtain, but GAN can solve this problem very well [ 8 , 9 ]. Therefore, the problem of how to combine GAN and noise reduction came into being.…”
Section: Introductionmentioning
confidence: 99%