Interspeech 2016 2016
DOI: 10.21437/interspeech.2016-517
|View full text |Cite
|
Sign up to set email alerts
|

Detecting Mispronunciations of L2 Learners and Providing Corrective Feedback Using Knowledge-Guided and Data-Driven Decision Trees

Abstract: We propose a novel decision tree based framework to detect phonetic mispronunciations produced by L2 learners caused by using inaccurate speech attributes, such as manner and place of articulation. Compared with conventional score-based CAPT (computer assisted pronunciation training) systems, our proposed framework has three advantages: (1) each mispronunciation in a tree can be interpreted and communicated to the L2 learners by traversing the corresponding path from a leaf node to the root node; (2) correctiv… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 17 publications
(14 citation statements)
references
References 21 publications
0
14
0
Order By: Relevance
“…To provide those situations of oral interaction technically to learners, dialogue-based CALL (Computer Aided Language Learning) systems have been developed [1,2,3], where not only pronunciation errors but also grammatical errors can be detected and their corrective feedback is also provided. To assess learners' pronunciation, native speakers' acoustic models are often referred to and comparison is made between learners' speech and its corresponding native model.…”
Section: Background and Objectivementioning
confidence: 99%
“…To provide those situations of oral interaction technically to learners, dialogue-based CALL (Computer Aided Language Learning) systems have been developed [1,2,3], where not only pronunciation errors but also grammatical errors can be detected and their corrective feedback is also provided. To assess learners' pronunciation, native speakers' acoustic models are often referred to and comparison is made between learners' speech and its corresponding native model.…”
Section: Background and Objectivementioning
confidence: 99%
“…As can be seen in Section 4, the number of the correctly pronounced phone is much larger than the number of mispronounced, which could make trained models biased. To prevent the bias problem, we adopt other phones' correctly pronounced observations as mispro-nounced samples of the target phone as much as the difference between the number of correct instances and the number of incorrect instances to make a balance [5].…”
Section: Methodsmentioning
confidence: 99%
“…There have been several studies to detect pronunciation errors of learners [2][3][4] [5]. The study of [2] suggested an extended recognition network (ERN), which expands pronunciation dictionaries of learners by predicting frequent erroneous pronunciation sequences.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Faced with the challenges of inconsistency in non-native phone-based labeling and imperfect acoustic modeling, our previous work [18,19] has investigated articulatory-based modeling for CAPT, where speech attributes [20,21] {lee.wei, chl}@gatech.edu, marco.siniscalchi@unikore.it, nfychen@i2r.a-star.edu.sg…”
Section: Introductionmentioning
confidence: 99%