FPA 2021
DOI: 10.54216/fpa.040202
|View full text |Cite
|
Sign up to set email alerts
|

Image Caption Generation and Comprehensive Comparison of Image Encoders

Abstract: Image caption generation is a stimulating multimodal task. Substantial advancements have been made in thefield of deep learning notably in computer vision and natural language processing. Yet, human-generated captions are still considered better, which makes it a challenging application for interactive machine learning. In this paper, we aim to compare different transfer learning techniques and develop a novel architecture to improve image captioning accuracy. We compute image feature vectors using different s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 32 publications
0
6
0
Order By: Relevance
“…For a common analysis, a maximum of 10 min observations is accounted for in this assessment. Based on this information, the sensitivity, specificity, precision, and negative prediction metrics are compared with ML-ResNet [ 23 ], TSCNN [ 27 ], and Big-ECG [ 17 ] techniques.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…For a common analysis, a maximum of 10 min observations is accounted for in this assessment. Based on this information, the sensitivity, specificity, precision, and negative prediction metrics are compared with ML-ResNet [ 23 ], TSCNN [ 27 ], and Big-ECG [ 17 ] techniques.…”
Section: Discussionmentioning
confidence: 99%
“…The defined model is mostly used for identifying the type of cardiovascular disease. The feature extraction process is done using a convolutional neural network (CNN) and long short-term memory (LSTM) algorithm which find out the optimal features [ 23 ]. The feature extraction process extracts both temporal and special features which are given by ECG signals.…”
Section: Related Workmentioning
confidence: 99%
“…To speed up the program, it is important to convert the downloaded image into a gray image. Gray scale can represent the total area and distribution area of the whole image as well as the color and brightness characteristics [ 16 ].…”
Section: Methodsmentioning
confidence: 99%
“…e purpose of the brown analysis is to examine the points in the digital image where the exposure will change drastically. Significant changes in image properties often indicate significant events and changes in assets [9]. ese include differences in depth, differences in surface orientation, changes in material properties, and changes in environmental descriptions.…”
Section: Methods Of Constructing English Translation Teaching Modementioning
confidence: 99%