2021
DOI: 10.1186/s13636-021-00209-4
|View full text |Cite
|
Sign up to set email alerts
|

Neural network-based non-intrusive speech quality assessment using attention pooling function

Abstract: Recently, the non-intrusive speech quality assessment method has attracted a lot of attention since it does not require the original reference signals. At the same time, neural networks began to be applied to speech quality assessment and achieved good performance. To improve the performance of non-intrusive speech quality assessment, this paper proposes a neural network-based assessment method using attention pooling function. The proposed systems are based on the convolutional neural networks (CNNs), bidirec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 23 publications
0
2
0
Order By: Relevance
“…Liu et al [25] has presented non-intrusive speech quality assessment depend on DNN for speech communication. Here, describes DL-depend strategy uses large-scale intrusive simulated data to increase accurateness, generalizability of non-intrusive techniques.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Liu et al [25] has presented non-intrusive speech quality assessment depend on DNN for speech communication. Here, describes DL-depend strategy uses large-scale intrusive simulated data to increase accurateness, generalizability of non-intrusive techniques.…”
Section: Literature Reviewmentioning
confidence: 99%
“…The work in [18]- [57] describes MLbased approaches to NR quality estimation. Some of these NR tools produce estimates of subjective test scores that report speech or sound quality mean opinion score (MOS) [18]- [20], [25]- [28], [31], [36], [40], [42], [43], [45], [49], [50], [57], naturalness [29], [35], [37], listening effort [24], noise intrusiveness [50], and speech intelligibility [21], [33]. The non-intrusive speech quality assessment model called NISQA [53] uses log-mel-spectrograms to produce estimates of subjective speech quality as well as four constituent dimensions: noisiness, coloration, discontinuity, and loudness.…”
Section: A Existing Machine Learning Approachesmentioning
confidence: 99%
“…As machine learning (ML) has become more powerful and accessible, numerous research groups have sought to apply ML to develop NR tools [17]- [50]. Some of these NR tools produce estimates of subjective test scores that report speech or sound quality mean opinion score (MOS) [17]- [19], [24]- [27], [30], [35], [38], [40], [41], [46], [47], naturalness [28], [34], [36], listening effort [23], noise intrusiveness [47], and speech intelligibility [20], [32]. The non-intrusive speech quality assessment model called NISQA [50] produces estimates of subjective speech quality as well as four constituent dimensions: noisiness, coloration, discontinuity, and loudness.…”
Section: A Existing Machine Learning Approachesmentioning
confidence: 99%