2022
DOI: 10.3390/math10030288
|View full text |Cite
|
Sign up to set email alerts
|

Modeling of Hyperparameter Tuned Deep Learning Model for Automated Image Captioning

Abstract: Image processing remains a hot research topic among research communities due to its applicability in several areas. An important application of image processing is the automatic image captioning technique, which intends to generate a proper description of an image in a natural language automated. Image captioning is a recently developed hot research topic, and it started to receive significant attention in the field of computer vision and natural language processing (NLP). Since image captioning is considered … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(4 citation statements)
references
References 23 publications
(25 reference statements)
0
1
0
Order By: Relevance
“…COCO contains many features: Object segmentation, 330K images (>200K labelled), 1.5 million object samples, 91 stuff categories, 80 object categories, and 5 captions per image. In Table II, the overall image captioning results of the AIC-SSAIDL technique with recent models are made on the Flickr8k dataset [25,26]. The experimental values portray the improvement of the AIC-SSAIDL technique.…”
Section: Resultsmentioning
confidence: 99%
“…COCO contains many features: Object segmentation, 330K images (>200K labelled), 1.5 million object samples, 91 stuff categories, 80 object categories, and 5 captions per image. In Table II, the overall image captioning results of the AIC-SSAIDL technique with recent models are made on the Flickr8k dataset [25,26]. The experimental values portray the improvement of the AIC-SSAIDL technique.…”
Section: Resultsmentioning
confidence: 99%
“…This method computes coupling coefficients between the underlying and output capsules in order to update the attention weights. Omri et al [13] and Zhu and Yan [14] proposed deep learning method to improve results of automated image captioning. In the study of Thangave et al [15], provides a heterogeneous data fusion-based deep learning model for image captioning.…”
Section: Related Workmentioning
confidence: 99%
“…Ref. [42] proposes inserting predefined length vectors to generate effective descriptions of input images, using the bird swarm algorithm (BSA) and long short-term memory (LSTM) models for sentence generation, to enhance image captioning performance. Ref.…”
Section: Multimodal Learningmentioning
confidence: 99%