2021
DOI: 10.1007/s11277-021-08211-x
|View full text |Cite
|
Sign up to set email alerts
|

Ensembling of text and images using Deep Convolutional Neural Networks for Intelligent Information Retrieval

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 15 publications
(8 citation statements)
references
References 26 publications
0
8
0
Order By: Relevance
“…(1) VGGNet [23]. It is deep network structure consisting of 3 × 3 small convolutional kernels, commonly used in VGG16 and VGG19.…”
Section: Experimental Analysis and Resultsmentioning
confidence: 99%
“…(1) VGGNet [23]. It is deep network structure consisting of 3 × 3 small convolutional kernels, commonly used in VGG16 and VGG19.…”
Section: Experimental Analysis and Resultsmentioning
confidence: 99%
“…The template and retrieval-based image captions generate the possible description of the images. The advanced image captioning method [12,13] includes the encoder and decoder structure in identifying the description for the images. In addition, the description performance is improved using the attention mechanism [14] and by capturing the relationship information about the objects.…”
Section: Literature Reviewmentioning
confidence: 99%
“…These layers are usually convolution, pooling, and fully connected layers. The convolution layer is the most fundamental element of a CNN architecture and is used for feature extraction and nonlinear processing [25]. Negative values resulting from convolution are eliminated by applying a non-linear process to the resulting image of the weighted sum.…”
Section: Lightweight Convolutional Neural Networkmentioning
confidence: 99%