Shuju Shi scite author profile

Shuju Shi

5Publications

31Citation Statements Received

50Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Illinois Urbana-Champaign, Beijing Language and Culture University

Publications

Order By: Most citations

Automatic Assessment and Error Detection of Shadowing Speech: Case of English Spoken by Japanese Learners

Shi¹,

Kashiwagi

Toyama³

et al. 2016

View full text Add to dashboard Cite

Shadowing is a task where the subject is required to repeat the presented speech as s/he hears it. Although shadowing is cognitively a challenging task, it is considered as an efficient way of language training since it includes processes of listening, speaking and comprehension simultaneously. Our previous study realized automatic assessment of shadowing speech using the average of Goodness of Pronunciation (GOP) scores. But the fact that shadowing often includes broken utterances makes this approach insufficient. This study attempts to improve automatic assessment and, at the same time, give corrective feedbacks to learners based on error detection. We first manually labeled shadowing speech of 10 female and 10 male speakers and defined ten typical error types including word omission, substitution etc.. Forced alignment with adjusted grammar and GOP scores are adopted to detect word omission errors and poorly pronounced words. In the experiments, GOP scores, Word Recognition Rate (WRR), silence ratio, forced alignment log-likelihood scores, word omission rate are used to predict the overall proficiency of the individual speakers. The mean correlation coefficient between automatic scores and the speaker's TOEIC scores is 0.81, improved by 13% relatively. The detection accuracy of word omission is 73%.

show abstract

How native speakers of Japanese try to sound polite

Tsurutani

Shi

2018

View full text Add to dashboard Cite

This study examines phonetic cues used to express politeness in spoken Japanese. The tasks of producing polite and non-polite speech in two different types of sentences (a question and a polite imperative) and in attitudinal speech (a request and a decline) were used to examine various F0 and temporal aspects of polite speech. Eight sentences spoken by 18 native speakers were acoustically measured at both sentence level and sentence final mora level. It was found that Japanese native speakers generally use a slower speech rate and a breathy voice for polite speech, but not necessarily a high pitched voice or wider pitch range, even in the case of female speakers. The use of pitch was found to be attitude dependent, but was not affected by the sentence type. Clear gender differences were seen in various phonetic aspects. Some politeness strategies observed at individual level are also reported.

show abstract

An ASR-free Fluency Scoring Approach with Self-Supervised Learning

Liu¹,

Fu²,

Tian³

et al. 2023

Preprint

View full text Add to dashboard Cite

A typical fluency scoring system generally relies on an automatic speech recognition (ASR) system to obtain time stamps in input speech for either the subsequent calculation of fluency-related features or directly modeling speech fluency with an end-to-end approach. This paper describes a novel ASR-free approach for automatic fluency assessment using self-supervised learning (SSL). Specifically, wav2vec2.0 is used to extract frame-level speech features, followed by K-means clustering to assign a pseudo label (cluster index) to each frame. A BLSTM-based model is trained to predict an utterance-level fluency score from frame-level SSL features and the corresponding cluster indexes. Neither speech transcription nor time stamp information is required in the proposed system. It is ASR-free and can potentially avoid the ASR errors effect in practice. Experimental results carried out on non-native English databases show that the proposed approach significantly improves the performance in the "open response" scenario as compared to previous methods and matches the recently reported performance in the "read aloud" scenario.

show abstract

Cross-language comparison of F0 range in speakers of native Chinese, native Japanese and Chinese L2 of Japanese: Preliminary results of a corpus-based analysis

Shi

Zhang

Xie

2014

View full text Add to dashboard Cite

Modeling Clinical Empathy in Narrative Essays

Shi

Sun

Zavala

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Shuju Shi

Automatic Assessment and Error Detection of Shadowing Speech: Case of English Spoken by Japanese Learners

How native speakers of Japanese try to sound polite

An ASR-free Fluency Scoring Approach with Self-Supervised Learning

Cross-language comparison of F0 range in speakers of native Chinese, native Japanese and Chinese L2 of Japanese: Preliminary results of a corpus-based analysis

Modeling Clinical Empathy in Narrative Essays

Contact Info

Product

Resources

About