2021
DOI: 10.48550/arxiv.2107.03675
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Multilingual Speech Evaluation: Case Studies on English, Malay and Tamil

Abstract: Speech evaluation is an essential component in computerassisted language learning (CALL). While speech evaluation on English has been popular, automatic speech scoring on low resource languages remains challenging. Work in this area has focused on monolingual specific designs and handcrafted features stemming from resource-rich languages like English. Such approaches are often difficult to generalize to other languages, especially if we also want to consider suprasegmental qualities such as rhythm. In this wor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 31 publications
(36 reference statements)
0
1
0
Order By: Relevance
“…In the features based approaches, various types of fluency related features, e.g. statistics of speech breaks [6][7][8], speech rate [6][7][8][9][10], filled pauses [7] and goodness of pronunciation (GOP) [11,12], are computed from time stamps provided for input speech, e.g., beginning and end time of words, phonemes. Different machine learning models, e.g., support vector machine (SVM) [7,13] and feed-forward neural networks [8], were trained to predict fluency scores from the extracted features.…”
Section: Introductionmentioning
confidence: 99%
“…In the features based approaches, various types of fluency related features, e.g. statistics of speech breaks [6][7][8], speech rate [6][7][8][9][10], filled pauses [7] and goodness of pronunciation (GOP) [11,12], are computed from time stamps provided for input speech, e.g., beginning and end time of words, phonemes. Different machine learning models, e.g., support vector machine (SVM) [7,13] and feed-forward neural networks [8], were trained to predict fluency scores from the extracted features.…”
Section: Introductionmentioning
confidence: 99%