Learning Shared Semantic Space with Correlation Alignment for Cross-Modal Event Retrieval

Yang, Zhenguo; Lin, Zehang; Kang, Peipei; Lv, Jianming; Li, Qing; Wenyin, Liu

doi:10.1145/3374754

Cited by 24 publications

(5 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The technique is known as hybrid representation learning (HRL) in which stacked restricted Boltzmann machines are used for extracting modality-friendly representation and a multi-modal deep belief network is exploited for extracting modality-mutual representation. Shared semantic space with correlation alignment (S 3 CA) is introduced in [26] for multi-modal data representation. Non-linear correlations of multi-modal data distributions are aligned in deep neural networks constructed for dissimilar data.…”

Section: Deep Learning Based Methodsmentioning

confidence: 99%

Hybrid SOM based cross-modal retrieval exploiting Hebbian learning

Kaur

Malhi

Pannu

2022

Knowledge-Based Systems

View full text Add to dashboard Cite

Section: Deep Learning Based Methodsmentioning

confidence: 99%

Hybrid SOM based cross-modal retrieval exploiting Hebbian learning

Kaur

Malhi

Pannu

2022

Knowledge-Based Systems

View full text Add to dashboard Cite

“…This whole network facilitates effective iterative parameter optimization. In [99], a shared se-mantic space with correlation alignment (S3CA) is proposed for cross-modal data representation. It aligns the non-linear correlations of cross-modal data distribution in deep neural networks made for diversified data.…”

Section: Machine Learning and Deep Learning Based Methodsmentioning

confidence: 99%

Comparative analysis on cross-modal information retrieval: A review

Kaur

Pannu

Malhi

2021

Computer Science Review

View full text Add to dashboard Cite

“…CFDNet [6] proposed to minimize the distance between the characteristic functions of distribution for feature alignment. On similar lines, [40] proposed to learn a latent space by matching the correlation matrix of different domains. All these works, however, require prior knowledge of ground truth labels which may not be available in most scenarios.…”

Section: A Domain Adaptationmentioning

confidence: 99%

Boundary Preserving Twin Energy-Based-Models for Image to Image Translation

Tiwary¹,

Bhattacharyya²,

Ap³

2022

Preprint

View full text Add to dashboard Cite

<p>Domain shift refers to change of distributional characteristics between the training (source) and the testing (target) datasets of a learning task, leading to performance drop. For tasks involving medical images, domain shift may be caused because of several factors such as change in underlying imaging modalities, measuring devices and staining mechanisms. Recent approaches address this issue via generative models based on the principles of adversarial learning albeit they suffer from issues such as difficulty in training and lack of diversity. Motivated by the aforementioned observations, we adapt an alternative class of deep generative models called the Energy Based Models (EBMs) for the task of unpaired image-to-image translation of medical images. Specifically, we propose a novel method called the Boundary Preserving Twin EBMs (BPT-EBM) which employs a pair of EBMs in the latent space of an Auto-Encoder trained on the source data. While one of the EBMs translates the source to the target domain the other does vice-versa along with a novel boundary preserving loss, ensuring translation symmetry and coupling between the domains. We theoretically analyze the proposed method and show that our design leads to better translation between the domains with reduced langevin mixing steps. We demonstrate the efficacy of our method through detailed quantitative and qualitative experiments on image segmentation tasks on three different datasets vis-a-vis state-of-the-art methods. </p>

show abstract

Learning Shared Semantic Space with Correlation Alignment for Cross-Modal Event Retrieval

Cited by 24 publications

References 30 publications

Hybrid SOM based cross-modal retrieval exploiting Hebbian learning

Hybrid SOM based cross-modal retrieval exploiting Hebbian learning

Comparative analysis on cross-modal information retrieval: A review

Boundary Preserving Twin Energy-Based-Models for Image to Image Translation

Contact Info

Product

Resources

About