Disfluency Detection with a Semi-Markov Model and Prosodic Features

Ferguson, James F.; Durrett, Greg; Klein, Dan

doi:10.3115/v1/n15-1029

Cited by 44 publications

(52 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…We compare our transition-based neural model to five top performing systems. Our model outperforms the state-of-the-art, achieving a 87.5% F- 91.6 82.3 86.7 Bi-LSTM (Zayats et al, 2016) 91.8 80.6 85.9 semi-CRF (Ferguson et al, 2015) 90.0 81.2 85.4 UBT (Wu et al, 2015) 90.3 80.5 85.1 M 3 N (Qian and Liu, 2013) --84.1 Table 5: Comparison with previous state-of-theart methods on the test set of English Switchboard.…”

Section: Performance On English Swtichboardmentioning

confidence: 98%

“…It achieves 2.4 point improvements over UBT (Wu et al, 2015), which is the best syntax-based method for disfluency detection. The best performance by linear statistical sequence labeling methods is the semi-CRF method of Ferguson et al (2015), achieving a 85.4% Fscore leveraging prosodic features. Our model obtains a 2.1 point improvement compared to this.…”

Section: Performance On English Swtichboardmentioning

confidence: 99%

“…The hand-crafted discrete features of CRF refer to those in Ferguson et al (2015). For the Bi-LSTM model, the token embedding is the same with our transition-based method.…”

Section: Performance On English Swtichboardmentioning

confidence: 99%

“…Another drawback of these approaches is that they are unable to exploit chunk-level features. Semi-CRF (Ferguson et al, 2015) is used to alleviate this issue to some extent. Semi-CRF models still have their inefficiencies because they can only use the local chunk information limited by the markov assumption when decoding.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Transition-Based Disfluency Detection using LSTMs

Wang¹,

Che²,

Zhang³

et al. 2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

We model the problem of disfluency detection using a transition-based framework, which incrementally constructs and labels the disfluency chunk of input sentences using a set of transition actions without syntax information. Compared with sequence labeling methods, it can capture non-local chunk-level features; compared with joint parsing and disfluency detection methods, it is free for noise in syntax. Experiments show that our model achieves state-of-theart F-score on both the commonly used English Switchboard test set and a set of in-house annotated Chinese data.

show abstract

Section: Performance On English Swtichboardmentioning

confidence: 98%

Section: Performance On English Swtichboardmentioning

confidence: 99%

“…The hand-crafted discrete features of CRF refer to those in Ferguson et al (2015). For the Bi-LSTM model, the token embedding is the same with our transition-based method.…”

Section: Performance On English Swtichboardmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Transition-Based Disfluency Detection using LSTMs

Wang¹,

Che²,

Zhang³

et al. 2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

show abstract

“…4 Qian and Liu (2013) 82.1 Honnibal and Johnson (2014) 84.1 Ferguson et al (2015) * 85.4 Zwarts and Johnson (2011) 85.7 Zayats et al (2016) * 85.9 LSTM-NCM 86.8 Table 6: Comparison of the LSTM-NCM to stateof-the-art methods on the dev set. *Models have used richer input.…”

Section: Discussionmentioning

confidence: 99%

Disfluency Detection using a Noisy Channel Model and a Deep Neural Language Model

Lou¹,

Johnson²

2017

Proceedings of the 55th Annual Meeting of the Association For Computational Linguistics (Volume 2: Short Papers)

View full text Add to dashboard Cite

This paper presents a model for disfluency detection in spontaneous speech transcripts called LSTM Noisy Channel Model. The model uses a Noisy Channel Model (NCM) to generate n-best candidate disfluency analyses and a Long Short-Term Memory (LSTM) language model to score the underlying fluent sentences of each analysis. The LSTM language model scores, along with other features, are used in a MaxEnt reranker to identify the most plausible analysis. We show that using an LSTM language model in the reranking process of noisy channel disfluency model improves the state-of-the-art in disfluency detection.

show abstract