2021
DOI: 10.1155/2021/5538927
|View full text |Cite
|
Sign up to set email alerts
|

Features to Text: A Comprehensive Survey of Deep Learning on Semantic Segmentation and Image Captioning

Abstract: With the emergence of deep learning, computer vision has witnessed extensive advancement and has seen immense applications in multiple domains. Specifically, image captioning has become an attractive focal direction for most machine learning experts, which includes the prerequisite of object identification, location, and semantic understanding. In this paper, semantic segmentation and image captioning are comprehensively investigated based on traditional and state-of-the-art methodologies. In this survey, we d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(12 citation statements)
references
References 167 publications
(135 reference statements)
0
8
0
Order By: Relevance
“…Meanwhile, the decoder part uses Bird Swarm Algorithm (BSA) with LSTM technique so as to concentrate on the generation of descriptive sentences. In an earlier study [19], image captioning and semantic segmentation were widely inspected on the basis of advanced and traditional methods. In this study, the researchers detailed about the application of DL in segmentation examination of both 3D and 2D images utilizing FCN and other high-level hierarchical feature extraction methodologies.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Meanwhile, the decoder part uses Bird Swarm Algorithm (BSA) with LSTM technique so as to concentrate on the generation of descriptive sentences. In an earlier study [19], image captioning and semantic segmentation were widely inspected on the basis of advanced and traditional methods. In this study, the researchers detailed about the application of DL in segmentation examination of both 3D and 2D images utilizing FCN and other high-level hierarchical feature extraction methodologies.…”
Section: Literature Reviewmentioning
confidence: 99%
“…The emotional attributes of short text can be minimized, and then the functions extracted from these two channels are folded and input into the classifier that determines the emotion of the text [ 26 , 27 ]. Each channel of the CNN directly affects the original data, and then the subsequent layers of the multilayer CNN will affect the processed data, so the CNN can extract more direct functions from these two channels [ 28 ]. Figure 7 shows the operation process of the model.…”
Section: Model Establishment and Scheme Designmentioning
confidence: 99%
“…There are six traditional image segmentation methods based on the threshold, edge detection, graph theory, region, clustering, and specific theoretical tools. For example, reference [ 12 ] analyzed the principles, advantages, and disadvantages of image semantic segmentation based on traditional methods and deep learning methods and pointed out that deep learning network had better optimization results than traditional methods. Reference [ 13 ] proposed a new image redirection method using semantic segmentation and pixel fusion, which could finely reassign the scaling factor for each region according to the semantic segmentation results, so as to effectively reduce the geometric distortion in the process of image redirection, but the detection efficiency needs to be improved.…”
Section: Introductionmentioning
confidence: 99%