Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua 2015
DOI: 10.3115/v1/n15-1029
|View full text |Cite
|
Sign up to set email alerts
|

Disfluency Detection with a Semi-Markov Model and Prosodic Features

Abstract: We present a discriminative model for detecting disfluencies in spoken language transcripts. Structurally, our model is a semi-Markov conditional random field with features targeting characteristics unique to speech repairs. This gives a significant performance improvement over standard chain-structured CRFs that have been employed in past work. We then incorporate prosodic features over silences and relative word duration into our semi-CRF model, resulting in further performance gains; moreover, these feature… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
46
0
6

Year Published

2016
2016
2022
2022

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 44 publications
(52 citation statements)
references
References 16 publications
0
46
0
6
Order By: Relevance
“…We compare our transition-based neural model to five top performing systems. Our model outperforms the state-of-the-art, achieving a 87.5% F- 91.6 82.3 86.7 Bi-LSTM (Zayats et al, 2016) 91.8 80.6 85.9 semi-CRF (Ferguson et al, 2015) 90.0 81.2 85.4 UBT (Wu et al, 2015) 90.3 80.5 85.1 M 3 N (Qian and Liu, 2013) --84.1 Table 5: Comparison with previous state-of-theart methods on the test set of English Switchboard.…”
Section: Performance On English Swtichboardmentioning
confidence: 98%
See 3 more Smart Citations
“…We compare our transition-based neural model to five top performing systems. Our model outperforms the state-of-the-art, achieving a 87.5% F- 91.6 82.3 86.7 Bi-LSTM (Zayats et al, 2016) 91.8 80.6 85.9 semi-CRF (Ferguson et al, 2015) 90.0 81.2 85.4 UBT (Wu et al, 2015) 90.3 80.5 85.1 M 3 N (Qian and Liu, 2013) --84.1 Table 5: Comparison with previous state-of-theart methods on the test set of English Switchboard.…”
Section: Performance On English Swtichboardmentioning
confidence: 98%
“…It achieves 2.4 point improvements over UBT (Wu et al, 2015), which is the best syntax-based method for disfluency detection. The best performance by linear statistical sequence labeling methods is the semi-CRF method of Ferguson et al (2015), achieving a 85.4% Fscore leveraging prosodic features. Our model obtains a 2.1 point improvement compared to this.…”
Section: Performance On English Swtichboardmentioning
confidence: 99%
See 2 more Smart Citations
“…4 Qian and Liu (2013) 82.1 Honnibal and Johnson (2014) 84.1 Ferguson et al (2015) * 85.4 Zwarts and Johnson (2011) 85.7 Zayats et al (2016) * 85.9 LSTM-NCM 86.8 Table 6: Comparison of the LSTM-NCM to stateof-the-art methods on the dev set. *Models have used richer input.…”
Section: Discussionmentioning
confidence: 99%