2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2016
DOI: 10.1109/icassp.2016.7472703
|View full text |Cite
|
Sign up to set email alerts
|

Landmark of Mandarin nasal codas and its application in pronunciation error detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 14 publications
0
6
0
Order By: Relevance
“…Moreover, current attribute-level pronunciation score is just calculated using simple summation of frame-level posteriors. In the future, advanced method such as dynamic programing [35] and articulatory landmark mechanism [36], will also be studied. Finally, due to the limited page size, this preliminary work only reports the average mispronunciation detection performance and analyzes only a few trees in Figures 2-4, more individual phone-dependent detection performance will be analyzed in the near future with a full report.…”
Section: Resultsmentioning
confidence: 99%
“…Moreover, current attribute-level pronunciation score is just calculated using simple summation of frame-level posteriors. In the future, advanced method such as dynamic programing [35] and articulatory landmark mechanism [36], will also be studied. Finally, due to the limited page size, this preliminary work only reports the average mispronunciation detection performance and analyzes only a few trees in Figures 2-4, more individual phone-dependent detection performance will be analyzed in the near future with a full report.…”
Section: Resultsmentioning
confidence: 99%
“…Considering the purpose of CAPT, DA and FRR are more important in measuring the detection performance than FAR [2,27]. The evaluation models and the decision threshold were optimized by aiming at maximizing DA.…”
Section: Figure 2 : Roc Curves Of Evaluations On Held-out Test Set Fomentioning
confidence: 99%
“…Indirect measurement models with CNN have received extensive attention over the last decades; Anjit and Rishidas [ 29 ] show the facial recognition model composing with two-layer deep CNN for feature extraction and SRC for classification. A variety of machine learning techniques have been used to predict nasal problems, such as the back-propagation neural network (BPNN) for the identification of the nasal bone [ 30 ], the long short-term memory neural network (LSTM) in SAHS event detection based on breathing [ 31 ], the random forest (RF) [ 32 ], and the support vector machine (SVM) [ 33 ]. Hybridization approaches integrating different machine learning techniques [ 34 ] have also been explored in recent years to improve nasal bone morphological prediction reliability and accuracy, with satisfactory morphology results.…”
Section: Introductionmentioning
confidence: 99%