2019 Global Conference for Advancement in Technology (GCAT) 2019
DOI: 10.1109/gcat47503.2019.8978293
|View full text |Cite
|
Sign up to set email alerts
|

Deep Learning based Automatic Image Caption Generation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(5 citation statements)
references
References 3 publications
0
5
0
Order By: Relevance
“…All the models are trained on the same dataset for concrete comparison. [5] Detection and Recognition of Objects in Image Caption Generator System: A Deep Learning Approach, N. K. Kumar, D. Vigneswari, A. Mohan, K. Laxman and J. Yuvaraj. The aim of this paper is to detect, recognize and generate worthwhile captions for a given image using deep learning.…”
Section: Literature Reviewmentioning
confidence: 99%
“…All the models are trained on the same dataset for concrete comparison. [5] Detection and Recognition of Objects in Image Caption Generator System: A Deep Learning Approach, N. K. Kumar, D. Vigneswari, A. Mohan, K. Laxman and J. Yuvaraj. The aim of this paper is to detect, recognize and generate worthwhile captions for a given image using deep learning.…”
Section: Literature Reviewmentioning
confidence: 99%
“…It alters the aim of forecasting the accurate word towards the aim of creating captions that are the same as the ground truth caption. Kesavan et al [10] systematically analyzed distinct deep DNN-based pre-trained models and image caption generation methods to accomplish the effective models by finetuning. The examined model contains with and without 'attention' concepts for optimizing the caption generation capacity.…”
Section: Literature Reviewmentioning
confidence: 99%
“…• Pooling layer: Layer is used when the images are too large. Pooling is done to make a small size of an image [3]. It is done on each dimension of depth independently, so the image depth will remain the same.…”
Section: Image Caption Architecturementioning
confidence: 99%
“…By using the hierarchical structure of LSTM, various aspects can be obtained for different levels of information, and attention can be determined based on seen information or details of language [7]. The losses are produced to check accuracy and to understand the learning parameters of networks [3]. The above structure is used for various scene captioning tasks that is captioning of video and image by using various terms of aspect extraction, structure of networks and losses.…”
Section: Hierarchical Structure Of Lstmmentioning
confidence: 99%
See 1 more Smart Citation