2022
DOI: 10.1108/el-09-2021-0173
|View full text |Cite
|
Sign up to set email alerts
|

Constructing a mobile visual search framework for Dunhuang murals based on fine-tuned CNN and ontology semantic distance

Abstract: Purpose Dunhuang murals are rich in cultural and artistic value. The purpose of this paper is to construct a novel mobile visual search (MVS) framework for Dunhuang murals, enabling users to efficiently search for similar, relevant and diversified images. Design/methodology/approach The convolutional neural network (CNN) model is fine-tuned in the data set of Dunhuang murals. Image features are extracted through the fine-tuned CNN model, and the similarities between different candidate images and the query i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 51 publications
0
1
0
Order By: Relevance
“…Multiple information sources in the form of two- dimensional images are transmitted to the CNN input layer of different image channels, and it is studied whether using multimodal images as input can improve the segmentation results. Their results show better performance than those using single modal input [9].…”
Section: Convolutional Neural Networkmentioning
confidence: 93%
“…Multiple information sources in the form of two- dimensional images are transmitted to the CNN input layer of different image channels, and it is studied whether using multimodal images as input can improve the segmentation results. Their results show better performance than those using single modal input [9].…”
Section: Convolutional Neural Networkmentioning
confidence: 93%
“…To enable users to effectively search for same, related and diverse Dunhuang mural images, Zeng et al proposed a CNN model tuned in the data set of Dunhuang murals. The finely tuned ResNet152 is the best choice for searching for similar images at the visual feature level, while improving the diversity of search results [13]. Wang et al studied the robustness of graph convolutional network (GCN), and proposed a new "false node attack" to attack GCN by adding malicious fake nodes.…”
Section: Related Workmentioning
confidence: 99%
“…The convolutional operation of the convolutional layer calculates a linear response; the pooling layer pools the feature data after convolution, and the number of training parameters decreases significantly after down sampling; the activation function layer adds non-linear variation. The activation function layer increases the nonlinear variation to update the generalization ability (Zeng et al 2022). As the audio data of musical tones will be interspersed with different levels of noise, the noise will have an impact on the model recognition and detection.…”
Section: Design Of Music Sound Recognition Model Fusing Rnn and Cnnmentioning
confidence: 99%