Automatic Assessment of the Speech of Young English Learners

Cheng, Jian; D'Antilio, Yuan Zhao; Chen, Xin; Bernstein, Jared

doi:10.3115/v1/w14-1802

Cited by 22 publications

(22 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The task types involved in this study are listed in Table 1. More details can be found in (Bernstein et al, 2010b;Cheng et al, 2014Cheng and Shen, 2010;Xu et al, 2012). Item types SentReading, SentRepeats, SentBuild, and PassReading are constrained: there is a set of pre-defined correct words and word sequences that test takers are expected to include in their response, although language models accept miscues and popular mistakes.…”

Section: Datasetsmentioning

confidence: 94%

“…In the last decade, the tasks used in the automatic spoken assessment research have extended from constrained ones (such as reading or repeating a sentence, or reading a passage) to open-ended ones (such as open questions or retellings of a story, passage, picture or presentation) (Bernstein et al, 2010b;Cheng et al, 2014;Evanini and Wang, 2013;Nair et al, 2005;Pearson, 2009;Zechner et al, 2009a). Providing a reasonable recognition performance in these open-ended tasks is a big challenge.…”

Section: Related Workmentioning

confidence: 98%

See 1 more Smart Citation

Deep neural network acoustic models for spoken assessment applications

Cheng

Chen

Metallinou

2015

Speech Communication

Self Cite

View full text Add to dashboard Cite

Section: Datasetsmentioning

confidence: 94%

Section: Related Workmentioning

confidence: 98%

Deep neural network acoustic models for spoken assessment applications

Cheng

Chen

Metallinou

2015

Speech Communication

Self Cite

View full text Add to dashboard Cite

“…Cheng, D'Antilio, Chen, and Bernstein () presented an automated speech scoring system for an Arizona K–12 language test for English‐language learners that contained a range of items, from predictable (read or repeated prompt) to more open ended. There were 11 different item types, of which 9 were open ended.…”

Section: Overview Of Item Types Usedmentioning

confidence: 99%

Performance of Automated Speech Scoring on Different Low‐ to Medium‐Entropy Item Types for Low‐Proficiency English Learners

Loukina

Zechner

Yoon

et al. 2017

ETS Research Report Series

View full text Add to dashboard Cite

This report presents an overview of the SpeechRaterSM automated scoring engine model building and evaluation process for several item types with a focus on a low‐English‐proficiency test‐taker population. We discuss each stage of speech scoring, including automatic speech recognition, filtering models for nonscorable responses, and scoring model building and evaluation and compare how the performance at each step differs between different item types. We conclude by discussing the effect of item type on automated scoring performance. We also give recommendations about what considerations should be taken into account when developing tests for low‐proficiency English speakers to obtain reliable scores from an automatic scoring engine.

show abstract

“…Xie et al (2012) explored content measures based on the lexical similarity between the response and a set of reference responses. A content-scoring component based on word vectors was also part of the automated scoring engine described by Cheng et al (2014). In both these studies, content features were developed to supplement other features measuring various aspects of speaking proficiency.…”

Section: Related Workmentioning

confidence: 99%

Proceedings of the Workshop on Speech-Centric Natural Language Processing

Ruiz¹,

Bangalore²

2017

View full text Add to dashboard Cite

The purpose of this workshop was to unite the automatic speech recognition (ASR) and natural language processing (NLP) communities to discuss new frameworks for exploiting the rich information present in the speech signal to improve the capabilities of natural language processing applications. Our community objective is to revisit the conventional NLP problems with a focus on incorporating the richness of spoken language, as well as to encourage research contributions that promote cross-fertilization between statistical methods for ASR and NLP.Our inaugural workshop was held at EMNLP to encourage participation amongst the NLP community to consider and discuss the challenges of combining speech recognition with conventional NLP research, as well as to appreciate the recent successes in this exciting field. The authors in these proceedings have combined ASR and NLP in works that address part-of-speech tagging, constituency parsing and dependency parsing on speech, information extraction and spoken term detection, dialog state tracking and speech translation, as well as two research assessments that evaluate the fluency and adequacy of English speakers and the role of speech silence in conversational dialogs.The invited talk was given by Gabriel Skantze, entitled "Modelling turn-taking in spoken interaction."Our workshop also contained an open round-table discussion about the current state of speech-centric NLP and some of the research and pragmatic issues that raise a barrier of entry for the larger research community.We would like to thank the members of the Program Committee for their reviews, as well as our panelists who led our round- AbstractSilence is an integral part of the most frequent turn-taking phenomena in spoken conversations. Silence is sized and placed within the conversation flow and it is coordinated by the speakers along with the other speech acts. The objective of this analytical study is twofold: to explore the functions of silence with duration of one second and above, towards information flow in a dyadic conversation utilizing the sequences of dialog acts present in the turns surrounding the silence itself; and to design a feature space useful for clustering the silences using a hierarchical concept formation algorithm. The resulting clusters are manually grouped into functional categories based on their similarities. It is observed that the silence plays an important role in response preparation, also can indicate speakers' hesitation or indecisiveness. It is also observed that sometimes long silences can be used deliberately to get a forced response from another speaker thus making silence a multi-functional and an important catalyst towards information flow.

show abstract

Automatic Assessment of the Speech of Young English Learners

Cited by 22 publications

References 21 publications

Deep neural network acoustic models for spoken assessment applications

Deep neural network acoustic models for spoken assessment applications

Performance of Automated Speech Scoring on Different Low‐ to Medium‐Entropy Item Types for Low‐Proficiency English Learners

Proceedings of the Workshop on Speech-Centric Natural Language Processing

Contact Info

Product

Resources

About