2020
DOI: 10.1145/3374754
|View full text |Cite
|
Sign up to set email alerts
|

Learning Shared Semantic Space with Correlation Alignment for Cross-Modal Event Retrieval

Abstract: In this paper, we propose to learn shared semantic space with correlation alignment (S 3 CA) for multimodal data representations, which aligns nonlinear correlations of multimodal data distributions in deep neural networks designed for heterogeneous data. In the context of cross-modal (event) retrieval, we design a neural network with convolutional layers and fully-connected layers to extract features for images, including images on Flickr-like social media. Simultaneously, we exploit a fully-connected neural … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
2
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 24 publications
(5 citation statements)
references
References 30 publications
0
5
0
Order By: Relevance
“…The technique is known as hybrid representation learning (HRL) in which stacked restricted Boltzmann machines are used for extracting modality-friendly representation and a multi-modal deep belief network is exploited for extracting modality-mutual representation. Shared semantic space with correlation alignment (S 3 CA) is introduced in [26] for multi-modal data representation. Non-linear correlations of multi-modal data distributions are aligned in deep neural networks constructed for dissimilar data.…”
Section: Deep Learning Based Methodsmentioning
confidence: 99%
“…The technique is known as hybrid representation learning (HRL) in which stacked restricted Boltzmann machines are used for extracting modality-friendly representation and a multi-modal deep belief network is exploited for extracting modality-mutual representation. Shared semantic space with correlation alignment (S 3 CA) is introduced in [26] for multi-modal data representation. Non-linear correlations of multi-modal data distributions are aligned in deep neural networks constructed for dissimilar data.…”
Section: Deep Learning Based Methodsmentioning
confidence: 99%
“…This whole network facilitates effective iterative parameter optimization. In [99], a shared se-mantic space with correlation alignment (S3CA) is proposed for cross-modal data representation. It aligns the non-linear correlations of cross-modal data distribution in deep neural networks made for diversified data.…”
Section: Machine Learning and Deep Learning Based Methodsmentioning
confidence: 99%
“…CFDNet [6] proposed to minimize the distance between the characteristic functions of distribution for feature alignment. On similar lines, [40] proposed to learn a latent space by matching the correlation matrix of different domains. All these works, however, require prior knowledge of ground truth labels which may not be available in most scenarios.…”
Section: A Domain Adaptationmentioning
confidence: 99%