Franz Götz-Hahn scite author profile

Video quality assessment (VQA) methods focus on particular degradation types, usually artificially induced on a small set of reference videos. Hence, most traditional VQA methods underperform in-the-wild. Deep learning approaches have had limited success due to the small size and diversity of existing VQA datasets, either artificial or authentically distorted. We introduce a new inthe-wild VQA dataset that is substantially larger and diverse: KonVid-150k. It consists of a coarsely annotated set of 153,841 videos having five quality ratings each, and 1,596 videos with a minimum of 89 ratings each. Additionally, we propose new efficient VQA approaches (MLSP-VQA) relying on multi-level spatially pooled deep-features (MLSP). They are exceptionally well suited for training at scale, compared to deep transfer learning approaches. Our best method, MLSP-VQA-FF, improves the Spearman rankorder correlation coefficient (SRCC) performance metric on the commonly used KoNViD-1k in-the-wild benchmark dataset to 0.82. It surpasses the best existing deep-learning model (0.80 SRCC) and handcrafted feature-based method (0.78 SRCC). We further investigate how alternative approaches perform under different levels of label noise, and dataset size, showing that MLSP-VQA-FF is the overall best method for videos in-the-wild. Finally, we show that the MLSP-VQA models trained on KonVid-150k sets the new state-of-the-art for cross-test performance on KoNViD-1k, LIVE-VQC, and LIVE-Qualcomm with a 0.83, 0.75, and 0.64 SRCC, respectively. For both KoNViD-1k and LIVE-VQC this inter-dataset testing outperforms intra-dataset experiments, showing excellent generalization.INDEX TERMS datasets, deep transfer learning, multi-level spatially-pooled features, video quality assessment, video quality dataset.

show abstract

Exploiting Transformer-Based Multitask Learning for the Detection of Media Bias in News Articles

Spinde

Krieger

Ruas

et al. 2022

View full text Add to dashboard Cite

Quantifying Visual Abstraction Quality for Computer-Generated Illustrations

Spicker

Götz-Hahn

Lindemeier

et al. 2019

ACM Trans. Appl. Percept.

View full text Add to dashboard Cite

We investigate how the perceived abstraction quality of computer-generated illustrations is related to the number of primitives (points and small lines) used to create them. Since it is difficult to find objective functions that quantify the visual quality of such illustrations, we propose an approach to derive perceptual models from a user study. By gathering comparative data in a crowdsourcing user study and employing a paired comparison model, we can reconstruct absolute quality values. Based on an exemplary study for stippling, we show that it is possible to model the perceived quality of stippled representations based on the properties of an input image. The generalizability of our approach is demonstrated by comparing models for different stippling methods. By showing that our proposed approach also works for small lines, we demonstrate its applicability toward quantifying different representational drawing elements. Our results can be related to Weber-Fechner's law from psychophysics and indicate a logarithmic relationship between number of rendering primitives in an illustration and the perceived abstraction quality thereof.

show abstract

Comment on "No-Reference Video Quality Assessment Based on the Temporal Pooling of Deep Features"

Götz-Hahn¹,

Hosu²,

Saupe³

2020

Preprint

View full text Add to dashboard Cite

In Neural Processing Letters 50,3 (2019) a machine learning approach to blind video quality assessment was proposed [14]. It is based on temporal pooling of features of video frames, taken from the last pooling layer of deep convolutional neural networks. The method was validated on two established benchmark datasets and gave results far better than the previous state-of-the-art. In this letter we report the results from our careful reimplementations. The performance results, claimed in the paper, cannot be reached, and are even below the state-of-the-art by a large margin. We show that the originally reported wrong performance results are a consequence of two cases of data leakage. Information from outside the training dataset was used in the fine-tuning stage and in the model evaluation.

show abstract

KonVid-150k: A Dataset for No-Reference Video Quality Assessment of Videos in-the-Wild

Götz-Hahn¹,

Hosu²,

Saupe³

2019

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Franz Götz-Hahn

KonVid-150k: A Dataset for No-Reference Video Quality Assessment of Videos in-the-Wild

Exploiting Transformer-Based Multitask Learning for the Detection of Media Bias in News Articles

Quantifying Visual Abstraction Quality for Computer-Generated Illustrations

Comment on "No-Reference Video Quality Assessment Based on the Temporal Pooling of Deep Features"

KonVid-150k: A Dataset for No-Reference Video Quality Assessment of Videos in-the-Wild

Contact Info

Product

Resources

About