2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017
DOI: 10.1109/icassp.2017.7953129
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning based non-intrusive quality estimation with an augmented feature set

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
64
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(65 citation statements)
references
References 13 publications
1
64
0
Order By: Relevance
“…Haemin et al [4] proposed a deep neural network (DNN) based non-intrusive speech quality estimation method in real-time voice communication systems. Hakami and Kleijn [5] used augmented feature set and the neural network to improve the prediction accuracy of the single-ended quality assessment approach. Quality-Net [6], based on bidirectional long short term memory (BLSTM), combined the frame-level scores to the final estimated utterance-level quality score using average pooling method.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Haemin et al [4] proposed a deep neural network (DNN) based non-intrusive speech quality estimation method in real-time voice communication systems. Hakami and Kleijn [5] used augmented feature set and the neural network to improve the prediction accuracy of the single-ended quality assessment approach. Quality-Net [6], based on bidirectional long short term memory (BLSTM), combined the frame-level scores to the final estimated utterance-level quality score using average pooling method.…”
Section: Related Workmentioning
confidence: 99%
“…It was proposed relatively early and its accuracy is far from intrusive methods. With the rapid development of *Correspondence: wangjing@bit.edu.cn 1 School of Information and Electronics, Beijing Institute of Technology, Beijing, China Full list of author information is available at the end of the article deep learning technology, many researchers have applied deep neural networks to speech quality assessment [4][5][6][7], which greatly improved the accuracy of non-intrusive methods. But none of them paid attention to the pooling function before the output of neural networks in speech quality assessment task.…”
Section: Introductionmentioning
confidence: 99%
“…The underlying random features in X are independent. Without loss of generality, we assume R X = I d×d (so it sets the scale), R U = hI d×d , and R W = gI t×t , where d and t are the dimensionality of X and Y respectively, and g and h are small [1]. These assumptions led to…”
Section: Model Behaviour For Redundant Featuresmentioning
confidence: 99%
“…As proposed in the previous section, using a large number of features is beneficial for better performance. The usage of a large number of features naturally leads to the inclusion of features that have poor behaviour [1].…”
Section: Pre-processing Featuresmentioning
confidence: 99%
See 1 more Smart Citation