2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2017
DOI: 10.1109/apsipa.2017.8282110
|View full text |Cite
|
Sign up to set email alerts
|

Perceptual evaluation of singing quality

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
48
0
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 28 publications
(50 citation statements)
references
References 16 publications
1
48
0
1
Order By: Relevance
“…where σ S and σ T are the standard deviations of signals S and T , respectively. Besides PCC, we also examine the Frame Disturbance between the converted prosody and the reference [41,42] . We first perform dynamic programming (DTW) to obtain the frame alignment between the original target and converted F0 contour, and calculate the number [25] and PSR [26]) and the traditional linear F0 conversion.…”
Section: Methodsmentioning
confidence: 99%
“…where σ S and σ T are the standard deviations of signals S and T , respectively. Besides PCC, we also examine the Frame Disturbance between the converted prosody and the reference [41,42] . We first perform dynamic programming (DTW) to obtain the frame alignment between the original target and converted F0 contour, and calculate the number [25] and PSR [26]) and the traditional linear F0 conversion.…”
Section: Methodsmentioning
confidence: 99%
“…That is, a higher weighting is used for localized distortions in PESQ pesnq apsipa transactions score computation. Motivated by this approach, we applied this concept of audio quality perception for singing quality assessment in our previous work [8] to obtain a novel PESQ-like singing quality score. PESQ combines the frame-level disturbance values of a degraded audio with respect to the original audio by computing the L 6 norm over split-second intervals, i.e.…”
Section: ) Cognitive Modeling: Localized Versus Distributed Errorsmentioning
confidence: 99%
“…The value of p in L p norm is higher for averaging over split-second intervals, to give more weight to localized disturbances than distributed disturbances. In our previous work [8], we applied the same idea of L 6 and L 2 norm to the frame disturbances computed from the dynamic time warping (DTW) optimal path deviation from the diagonal, for a test singing with respect to the reference singing. We applied it to different pitch and rhythm acoustic features, as will be discussed in Section III.4.…”
Section: ) Cognitive Modeling: Localized Versus Distributed Errorsmentioning
confidence: 99%
See 2 more Smart Citations