Shaojun Gao scite author profile

Shaojun Gao

2Publications

5Citation Statements Received

57Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Using Fluency Representation Learned from Sequential Raw Features for Improving Non-native Fluency Scoring

Fu¹,

Gao²,

Tian³

et al. 2022

View full text Add to dashboard Cite

Speech fluency/disfluency can be evaluated by analyzing a range of phonetic and prosodic features. Deep neural networks are commonly trained to map fluency-related features into the human scores. However, the effectiveness of deep learning-based models is constrained by the limited amount of labeled training samples. To address this, we introduce a self-supervised learning (SSL) approach that takes into account phonetic and prosody awareness for fluency scoring. Specifically, we first pre-train the model using a reconstruction loss function, by masking phones and their durations jointly on a large amount of unlabeled speech and text prompts. We then fine-tune the pre-trained model using human-annotated scoring data. Our experimental results, conducted on datasets such as Speechocean762 and our non-native datasets, show that our proposed method outperforms the baseline systems in terms of Pearson correlation coefficients (PCC). Moreover, we also conduct an ablation study to better understand the contribution of phonetic and prosody factors during the pre-training stage.

show abstract

Phonetic and Prosody-aware Self-supervised Learning Approach for Non-native Fluency Scoring

Fu¹,

Gao²,

Shi³

et al. 2023

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Shaojun Gao

Using Fluency Representation Learned from Sequential Raw Features for Improving Non-native Fluency Scoring

Phonetic and Prosody-aware Self-supervised Learning Approach for Non-native Fluency Scoring

Contact Info

Product

Resources

About