Stochastic learning of multi-instance dictionary for earth mover’s distance-based histogram comparison

Fan, Junhong; Liang, Ru‐Ze

doi:10.1007/s00521-016-2603-2

Cited by 16 publications

(5 citation statements)

References 49 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The experiments of the problems of image annotation and image retrieval based on image tag completion over three benchmark data sets show the advantage of the proposed method. In the future, we will extend our work of CNN model to other machine learning problems beside image tag completion, such as computer vision [16,5,30,40,22,38,39], material engineering [32,33], portfolio choices [26,25,27,24,28], and biomedical engineering [4,3,2,1,21,13,10,23,12,31,29,9].…”

Section: Discussionmentioning

confidence: 99%

A Novel Image Tag Completion Method Based on Convolutional Neural Transformation

Geng¹,

Zhang

et al. 2017

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

In the problems of image retrieval and annotation, complete textual tag lists of images play critical roles. However, in real-world applications, the image tags are usually incomplete, thus it is important to learn the complete tags for images. In this paper, we study the problem of image tag complete and proposed a novel method for this problem based on a popular image representation method, convolutional neural network (CNN). The method estimates the complete tags from the convolutional filtering outputs of images based on a linear predictor. The CNN parameters, linear predictor, and the complete tags are learned jointly by our method. We build a minimization problem to encourage the consistency between the complete tags and the available incomplete tags, reduce the estimation error, and reduce the model complexity. An iterative algorithm is developed to solve the minimization problem. Experiments over benchmark image data sets show its effectiveness.

show abstract

Section: Discussionmentioning

confidence: 99%

A Novel Image Tag Completion Method Based on Convolutional Neural Transformation

Geng¹,

Zhang

et al. 2017

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

show abstract

“…Data distribution similarity measure. The correlation between the generated data distribution and the original data distribution is assessed using the following metrics: Euclidean distance (ED), 35 KL divergence (KLD), 36 Cosine similarity (CS), 37 Earth Mover's Distance (EMD) 38 and Pearson correlation coefficient (PCC). 39 These five metrics are often used to measure the similarity between two data distributions.…”

Section: Case I: Case Western Reserve University Bearing Datasetmentioning

confidence: 99%

Bearing fault diagnosis based on variational autoencoder and non-local block wide kernel convolutional neural network

Jiang,

Guo,

Guo

et al. 2024

Proceedings of the Institution of Mechanical Engineers, Part C:

View full text Add to dashboard Cite

At present, convolutional neural network (CNN) is widely applied to bearing fault diagnosis However, the diagnosis performance will descend under the strong noise condition in the real industrial environment. Therefore, a denoising method named non-local block wide kernel CNN (NLBWCNN) is proposed based on wide convolution kernel and non-local block. Additionally, the data in the mechanical fault state is less than that in the health state in actual industrial production, which leads to the data imbalance problem. However, the fault classifier based on CNN needs a large amount of balanced data to train. Otherwise, it will not be fully trained, and thus its generalization ability will be affected. As a result, a method called VAE-NLBWCNN (variational autoencoder and NLBWCNN) is proposed for diagnosing bearing faults. The method employs variational autoencoder balanced the fault data. And then, the NLBWCNN is utilized to denoise and classify the fault data. The proposed VAE-NLBWCNN method is validated on three bearing datasets. The comparative experiments demonstrate that the proposed method can effectively expand unbalanced data and achieve the best performance in various noise conditions.

show abstract

“…For example, in the natural language problems, each sentence is treated as a sequence of words, and each word is represented by a word embedding vector. The sequence of word embedding vectors can be represented further by a CNN model for the problem of sematic classification [11,31,9]. Moreover, in computer vision applications, a video is also composed of a sequence of image frames, and we can also extract visual feature vector from each frame.…”

Section: Introductionmentioning

confidence: 99%

Cross-model convolutional neural network for multiple modality data representation

Wang

Cui

et al. 2017

Neural Comput & Applic

View full text Add to dashboard Cite

A novel data representation method of convolutional neural network (CNN) is proposed in this paper to represent data of different modalities. We learn a CNN model for the data of each modality to map the data of different modalities to a common space, and regularize the new representations in the common space by a cross-model relevance matrix. We further impose that the class label of data points can also be predicted from the CNN representations in the common space. The learning problem is modeled as a minimization problem, which is solved by an augmented Lagrange method (ALM) with updating rules of Alternating direction method of multipliers (ADMM). The experiments over benchmark of sequence data of multiple modalities show its advantage.

show abstract

Stochastic learning of multi-instance dictionary for earth mover’s distance-based histogram comparison

Cited by 16 publications

References 49 publications

A Novel Image Tag Completion Method Based on Convolutional Neural Transformation

A Novel Image Tag Completion Method Based on Convolutional Neural Transformation

Bearing fault diagnosis based on variational autoencoder and non-local block wide kernel convolutional neural network

Cross-model convolutional neural network for multiple modality data representation

Contact Info

Product

Resources

About