2020
DOI: 10.1016/j.csl.2019.101052
|View full text |Cite
|
Sign up to set email alerts
|

Sequence labeling to detect stuttering events in read speech

Abstract: This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 18 publications
(8 citation statements)
references
References 29 publications
0
6
0
Order By: Relevance
“…Sadeen et. al [20] studied two approaches to do sequence labeling for stuttered speech, Conditional Random Fields (CRF) and Bi-directional Long Short Term Memory Networks (BI-LSTM). The study showed an improvement in F1 Score by 33.6% for the BiL-STM model relative to CRF baseline, using ngram based features.…”
Section: Related Workmentioning
confidence: 99%
“…Sadeen et. al [20] studied two approaches to do sequence labeling for stuttered speech, Conditional Random Fields (CRF) and Bi-directional Long Short Term Memory Networks (BI-LSTM). The study showed an improvement in F1 Score by 33.6% for the BiL-STM model relative to CRF baseline, using ngram based features.…”
Section: Related Workmentioning
confidence: 99%
“…Following the Bayesian statistical framework, this entails maximization of the a-posteriori probability of the word sequence with disfluencies, given the originally intended word sequence. Since disfluency events affect different phonetic aspects of speech [6] many researchers tried to take advantage of the possible combination of different sources of knowledge on both acoustic and language model sides [7,8,9,10]. The disfluency detection task in all mentioned works is solved by using sequence labelling/tagging approaches which can rely either on a generative approach such as Hidden Markov Model or discriminative log-linear models, such as Maximum Entropy Markov Model or Conditional Random Fields as well as Bidirectional Long Short-Term Memory based networks in combination with an attention mechanism [11].…”
Section: Prior Workmentioning
confidence: 99%
“…Stuttering identification, an interdisciplinary research problem in which a myriad number of research work (in-terms of acoustic feature extraction and classification methods) are currently going on with a focus on developing automatic tools for its detection and identification. Most of the existing work detect and identify stuttering either by language models [73,74] or by ASR systems [75,76], which first converts the audio signals into its corresponding textual form, and then by the application of language models, detects or identifies stuttering. This section provides in detail the comprehensive review of the various acoustic based feature extraction and machine learning stuttering identification techniques, that have been used in the literature.…”
Section: Statistical Approachesmentioning
confidence: 99%