2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018
DOI: 10.1109/icassp.2018.8462635
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Discovery of an Extended Phoneme Set in L2 English Speech for Mispronunciation Detection and Diagnosis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
11
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 15 publications
0
11
0
Order By: Relevance
“…Non-categorical patterns acquisition framework proaches [9][10][11][12][13] proposed to capture the L2 pronunciation deviations from categorical phones. [9,12,13] try to capture each L2 error patterns of a given categorical phone, while [11] focuses on extending a native phone set to include the noncategorical patterns in L2 English speech. Actually, this work is an extension of [11].…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Non-categorical patterns acquisition framework proaches [9][10][11][12][13] proposed to capture the L2 pronunciation deviations from categorical phones. [9,12,13] try to capture each L2 error patterns of a given categorical phone, while [11] focuses on extending a native phone set to include the noncategorical patterns in L2 English speech. Actually, this work is an extension of [11].…”
Section: Introductionmentioning
confidence: 99%
“…[9,12,13] try to capture each L2 error patterns of a given categorical phone, while [11] focuses on extending a native phone set to include the noncategorical patterns in L2 English speech. Actually, this work is an extension of [11]. The distinction lies in 1) this approach extracts segment-level features to model the phonetic information while [11] uses non-segmental frame-level features and their average, thereby the accumulated errors are reduced and the discovered non-categories are more accurate with higher confusion degree; 2) this work uses a more simple but effective method to explore non-categories, while not involving k-means clustering method used in [11].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…To solve this problem, our previous work [18] discovered an Extended Phoneme Set in L2 speech (L2-EPS) by using unsupervised clustering algorithm based on phoneme-based features. It gives a more complete description on pronunciation patterns in L2 speech, some of which are ignored by the native phoneme set.…”
Section: Introductionmentioning
confidence: 99%
“…However, there are still some phonetic patterns that cannot be captured well by that approach. Because the phoneme-based phonemic posterior-grams (PPGs) used as clustering features in [18] cannot provide the state information within a segment, which is also important in reflecting the phonetic patterns.…”
Section: Introductionmentioning
confidence: 99%