2021 IEEE International Conference on Image Processing (ICIP) 2021
DOI: 10.1109/icip42928.2021.9506108
|View full text |Cite
|
Sign up to set email alerts
|

Generalized Zero-Shot Learning Using Multimodal Variational Auto-Encoder With Semantic Concepts

Abstract: With the ever-increasing amount of data, the central challenge in multimodal learning involves limitations of labelled samples For the task of classification, techniques such as meta-learning, zero-shot learning, and few-shot learning showcase the ability to learn information about novel classes based on prior knowledge . Recent techniques try to learn a cross-modal mapping between the semantic space and the image space. However, they tend to ignore the local and global semantic knowledge. To overcome this pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 16 publications
(8 citation statements)
references
References 13 publications
0
6
0
Order By: Relevance
“…Each of the three articles in the biology domain pertained to recognition [397], prediction [398] and identification [399] accordingly. In total, 4 bird species domain relevant papers were identified, where 3 were in classification [45], [400], [401] and 1 was in integration [402]. There were 2 articles each in the signal processing and gender domains.…”
Section: Inclusion Criteriamentioning
confidence: 99%
See 2 more Smart Citations
“…Each of the three articles in the biology domain pertained to recognition [397], prediction [398] and identification [399] accordingly. In total, 4 bird species domain relevant papers were identified, where 3 were in classification [45], [400], [401] and 1 was in integration [402]. There were 2 articles each in the signal processing and gender domains.…”
Section: Inclusion Criteriamentioning
confidence: 99%
“…Of eight articles in parallel, seven pertained to co-training [66], [192], [209], [210], [329], [376], [379] and one related to transfer learning [201]. On the other side, from 25 non-parallel articles, 11 pertained to transfer learning [83], [107], [108], [146], [208], [215], [223], [246], [285], [286], [333], four related to concept grounding [82], [219], [332], [335] and 10 related to zeroshot learning [84], [85], [175], [191], [224], [247], [287], [377], [400], [401]. In hybrid co-learning, two articles were related to bridging [195], [196].…”
Section: G Co-learningmentioning
confidence: 99%
See 1 more Smart Citation
“…A novel multimodal variational auto‐encoder (M‐VAE) algorithm for GZSL was proposed in Bendre et al (2021), which uses two modalities, images, and semantic concepts, as the input of the encoder. The encoder maps the input of these two modalities to a unified latent space, and the decoder reconstructs the latent embedding vector into the corresponding visual features.…”
Section: Representative Algorithmsmentioning
confidence: 99%
“…They achieve state-of-the-art results for solving particular machine learning problems. It is practically impossible to analyze all of them, but a significant number of them, at one step or another, use the classical concatenation of multimodal vectors (Chen et al, 2020 ; Xie et al, 2020 ; Bendre et al, 2021 ), without a deep examination of unique dependencies between them. Nevertheless, there are other models proposing smarter modality aggregation, such as the Contrastive Multimodal Fusion method (Liu et al, 2021 ), showing there is growing interest in the ML community for nontrivial multimodal fusion.…”
Section: State Of the Artmentioning
confidence: 99%