7th ISCA Workshop on Speech and Language Technology in Education (SLaTE 2017) 2017
DOI: 10.21437/slate.2017-12
|View full text |Cite
|
Sign up to set email alerts
|

Mispronunciation Diagnosis of L2 English at Articulatory Level Using Articulatory Goodness-Of-Pronunciation Features

Abstract: This paper proposes a method to provide an articulatory diagnosis of English produced by Korean learners using articulatory Goodness-Of-Pronunciation (aGOP) features, which are based on the distinctive feature theory in phonology. Previous studies on mispronunciation diagnosis have mainly dealt with pronunciation errors at phone-level. They inform learners of which phone is recognized as a diagnosis, when the corresponding segment is realized as a mispronunciation. However, to provide learners more effective c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 16 publications
(31 reference statements)
0
4
0
Order By: Relevance
“…Sudhakara et al [16] introduced context-aware GOP which takes both senone and transition state probabilities into consideration. Ryu et al [17] inferred that pronunciation scoring must combine phone level as well as articulatorylevel diagnoses such as voicing, place of articulation, and manner of articulation on consonants. Lin et al [18] used the acoustic model and replaced the forced alignment layer with a self-attention layer to get an utterance score based on transfer learning, but the results greatly depend on fine-tuning the scores of datasets.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Sudhakara et al [16] introduced context-aware GOP which takes both senone and transition state probabilities into consideration. Ryu et al [17] inferred that pronunciation scoring must combine phone level as well as articulatorylevel diagnoses such as voicing, place of articulation, and manner of articulation on consonants. Lin et al [18] used the acoustic model and replaced the forced alignment layer with a self-attention layer to get an utterance score based on transfer learning, but the results greatly depend on fine-tuning the scores of datasets.…”
Section: Related Workmentioning
confidence: 99%
“…The former approaches discussed are based only on the pronunciation scoring based on the likelihood of individual phones in sequential order, hence is limited to phone-level features extraction of the audio file to test [13][14][15][16][17][18][19]. The latter checks for a minimum distance for comparing different length time series or MFCC or LPC for audio comparison [20], [21], [22].…”
Section: Related Workmentioning
confidence: 99%
“…Another application of phones recognition and articulatory features estimation is Computer-Assisted Pronunciation Training. Some of the approaches are described in [8] and [9]. This paper dwells on applications of attention-based models to articulatory features detection.…”
Section: Introductionmentioning
confidence: 99%
“…Another application of phones recognition and articulatory features estimation is Computer-Assisted Pronunciation Training. Some of the approaches are described in [8] and [9].…”
Section: Introductionmentioning
confidence: 99%