2022
DOI: 10.1109/tcsvt.2021.3088505
|View full text |Cite
|
Sign up to set email alerts
|

Learning Generalized Spatial-Temporal Deep Feature Representation for No-Reference Video Quality Assessment

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 64 publications
(22 citation statements)
references
References 59 publications
0
21
0
Order By: Relevance
“…Finally, we compared the generalization performance of our proposed method with some deep learning NR-IQA models. Actually, authors of [68] have tested the deep learning NR-IQA models such as WaDIQaM [69] and SPAQ [70] on VQA databases (KoNViD-1k, LIVE-Qualcomm, and CVD2014) and concluded that their overall performance was not satisfactory due to temporal information being discarded. Furthermore, this article shows that NR-VQA models such as VSFA and CNN-TLVQM present better generalization performance than those deep learning NR-IQA models.…”
Section: ) Traditional Vqa Databasesmentioning
confidence: 99%
“…Finally, we compared the generalization performance of our proposed method with some deep learning NR-IQA models. Actually, authors of [68] have tested the deep learning NR-IQA models such as WaDIQaM [69] and SPAQ [70] on VQA databases (KoNViD-1k, LIVE-Qualcomm, and CVD2014) and concluded that their overall performance was not satisfactory due to temporal information being discarded. Furthermore, this article shows that NR-VQA models such as VSFA and CNN-TLVQM present better generalization performance than those deep learning NR-IQA models.…”
Section: ) Traditional Vqa Databasesmentioning
confidence: 99%
“…Li [12] et al extracted features from the pre-trained image classification neural network ResNet50 to obtain perception characteristics. Chen et al [13] adopted VGG-16 network to learn the frame-level features of videos, and then obtained the Gaussian distributed features through adversarial learning.…”
Section: Related Workmentioning
confidence: 99%
“…In addition, although deep learning can solve this problem, due to the lack of large-scale compressed video quality evaluation databases, the development of datadriven compressed video quality evaluation methods is still not perfect. The content perception features extracted by several methods of transfer learning [12], [13] are all in the spatial domain, which are obviously not enough for videos. To solve the above problems, it is necessary to construct a large compressed video quality evaluation database and propose a data-driven method to learn their spatiotemporal features.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Ying et al [10] demonstrated that features expressive of both combining local patch quality and global video quality can yield significant performance gains. GSTVQA [16] employs a pyramid temporal aggregation of short-term and long-term memory effects to achieve efficient quality prediction. Li et al [17] employs transfer learning on IQA databases, along with a list-wise ranking loss objective to achieve competitive performance.…”
Section: A Nr-vqa Modelsmentioning
confidence: 99%