A Sequential Guiding Network with Attention for Image Captioning

Daouda, Sow,; Qin, Zengchang; Niasse, Mouhamed; Wan, Tao

doi:10.1109/icassp.2019.8682505

Cited by 2 publications

(1 citation statement)

References 26 publications

(49 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Another way for learning the features is by adding a network for guidance [102]. More similar to [102], Sow et al [103] inserted a network for guidance, but rather than obtaining one vector for guidance, [103] obtained a sequential network for guidance which was able to adjust the guided vectors in the sentence generation process. They also utilized the Luong attention mechanism [104] that is an enhanced style of the attention technique.…”

Section: Guided Attentionmentioning

confidence: 99%

A Survey on Attention-Based Models for Image Captioning

Osman¹,

Shalaby²,

Soliman³

et al. 2023

IJACSA

View full text Add to dashboard Cite

Image captioning task is highly used in many realworld applications. The captioning task is concerned with understanding the image using computer vision methods. Then, natural language processing methods are used to produce a description for the image. Different approaches were proposed to solve this task, and deep learning attention-based models have been proven to be the state-of-the-art. A survey on attentionbased models for image captioning is presented in this paper including new categories that were not included in other survey papers. The attention-based approaches are classified into four main categories, further classified into subcategories. All categories and subcategories of the attention-based approaches are discussed in detail. Furthermore, the state-of-the-art approaches are compared and the accuracy improvements are stated especially in the transformer-based models, and a summary of the benchmark datasets and the main performance metrics is presented.

show abstract

Section: Guided Attentionmentioning

confidence: 99%

A Survey on Attention-Based Models for Image Captioning

Osman¹,

Shalaby²,

Soliman³

et al. 2023

IJACSA

View full text Add to dashboard Cite

show abstract

A Systematic Literature Review on Image Captioning

Staniūtė

Šešok

2019

Applied Sciences

View full text Add to dashboard Cite

Natural language problems have already been investigated for around five years. Recent progress in artificial intelligence (AI) has greatly improved the performance of models. However, the results are still not sufficiently satisfying. Machines cannot imitate human brains and the way they communicate, so it remains an ongoing task. Due to the increasing amount of information on this topic, it is very difficult to keep on track with the newest researches and results achieved in the image captioning field. In this study a comprehensive Systematic Literature Review (SLR) provides a brief overview of improvements in image captioning over the last four years. The main focus of the paper is to explain the most common techniques and the biggest challenges in image captioning and to summarize the results from the newest papers. Inconsistent comparison of results achieved in image captioning was noticed during this study and hence the awareness of incomplete data collection is raised in this paper. Therefore, it is very important to compare results of a newly created model produced with the newest information and not only with the state of the art methods. This SLR is a source of such information for researchers in order for them to be precisely correct on result comparison before publishing new achievements in the image caption generation field.

show abstract

A Sequential Guiding Network with Attention for Image Captioning

Cited by 2 publications

References 26 publications

A Survey on Attention-Based Models for Image Captioning

A Survey on Attention-Based Models for Image Captioning

A Systematic Literature Review on Image Captioning

Contact Info

Product

Resources

About