2023
DOI: 10.1049/ipr2.12790
|View full text |Cite
|
Sign up to set email alerts
|

Integration of the latent variable knowledge into deep image captioning with Bayesian modeling

Abstract: Automatic image captioning systems assign one or more sentences to images to describe their visual content. Most of these systems use attention-based deep convolutional neural networks and recurrent neural networks (CNN-RNN-Att). However, they must optimally use latent variables and side information within the image concepts. This study aims to integrate a latent variable into image captioning using CNN-RNN-Att. A Bayesian modeling framework is used for this work. As an instance of a latent variable, High-Leve… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 43 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?