2022
DOI: 10.48550/arxiv.2202.05474
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Bench-Marking And Improving Arabic Automatic Image Captioning Through The Use Of Multi-Task Learning Paradigm

Abstract: The continuous increase in the use of social media and the visual content on the internet have accelerated the research in computer vision field in general and the image captioning task in specific. The process of generating a caption that best describes an image is a useful task for various applications such as it can be used in image indexing and as a hearing aid for the visually impaired. In recent years, the image captioning task has witnessed remarkable advances regarding both datasets and architectures, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 26 publications
(31 reference statements)
0
0
0
Order By: Relevance
“…The image region vector is generated for each object detected and used as input to the final linear classification layer. The rest of the research depends on the CNN features, VggNet [13,15,18], ResNet [16,19,21], and Inception [18].…”
Section: Visual Modelsmentioning
confidence: 99%
See 3 more Smart Citations
“…The image region vector is generated for each object detected and used as input to the final linear classification layer. The rest of the research depends on the CNN features, VggNet [13,15,18], ResNet [16,19,21], and Inception [18].…”
Section: Visual Modelsmentioning
confidence: 99%
“…The experimental results showed promising performance. Al-muzaini et al [13] and ElJundi et al [15] use a single-layer LSTM with 256 memory units, while Za'ter et al [19] use two-layer LSTM with 256 memory units. Also, many other research works use LSTM, such as [14,16,17,21].…”
Section: Language Modelsmentioning
confidence: 99%
See 2 more Smart Citations