2016
DOI: 10.1007/s00521-016-2603-2
|View full text |Cite
|
Sign up to set email alerts
|

Stochastic learning of multi-instance dictionary for earth mover’s distance-based histogram comparison

Abstract: Dictionary plays an important role in multi-instance data representation. It maps bags of instances to histograms. Earth mover's distance (EMD) is the most effective histogram distance metric for the application of multiinstance retrieval. However, up to now, there is no existing multi-instance dictionary learning methods designed for EMD based histogram comparison. To fill this gap, we develop the first EMD-optimal dictionary learning method using stochastic optimization method. In the stochastic learning fra… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
4
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 16 publications
(5 citation statements)
references
References 49 publications
0
4
0
Order By: Relevance
“…The experiments of the problems of image annotation and image retrieval based on image tag completion over three benchmark data sets show the advantage of the proposed method. In the future, we will extend our work of CNN model to other machine learning problems beside image tag completion, such as computer vision [16,5,30,40,22,38,39], material engineering [32,33], portfolio choices [26,25,27,24,28], and biomedical engineering [4,3,2,1,21,13,10,23,12,31,29,9].…”
Section: Discussionmentioning
confidence: 99%
“…The experiments of the problems of image annotation and image retrieval based on image tag completion over three benchmark data sets show the advantage of the proposed method. In the future, we will extend our work of CNN model to other machine learning problems beside image tag completion, such as computer vision [16,5,30,40,22,38,39], material engineering [32,33], portfolio choices [26,25,27,24,28], and biomedical engineering [4,3,2,1,21,13,10,23,12,31,29,9].…”
Section: Discussionmentioning
confidence: 99%
“…Data distribution similarity measure. The correlation between the generated data distribution and the original data distribution is assessed using the following metrics: Euclidean distance (ED), 35 KL divergence (KLD), 36 Cosine similarity (CS), 37 Earth Mover's Distance (EMD) 38 and Pearson correlation coefficient (PCC). 39 These five metrics are often used to measure the similarity between two data distributions.…”
Section: Case I: Case Western Reserve University Bearing Datasetmentioning
confidence: 99%
“…For example, in the natural language problems, each sentence is treated as a sequence of words, and each word is represented by a word embedding vector. The sequence of word embedding vectors can be represented further by a CNN model for the problem of sematic classification [11,31,9]. Moreover, in computer vision applications, a video is also composed of a sequence of image frames, and we can also extract visual feature vector from each frame.…”
Section: Introductionmentioning
confidence: 99%