2009 International Forum on Information Technology and Applications 2009
DOI: 10.1109/ifita.2009.381
|View full text |Cite
|
Sign up to set email alerts
|

A Speech Endpoint Detection Based on Dynamically Updated Threshold of Box-Counting Dimension

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 4 publications
0
2
0
Order By: Relevance
“…The VAD block is used to detect the beginning and end of speech waveforms and to exclude nonspeech segments [3][4][5][6][7]. This technique is used in this work because nonspeech segments can degrade recognition performance, especially at a low signal-to-noise ratio (SNR) [3][4][5].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The VAD block is used to detect the beginning and end of speech waveforms and to exclude nonspeech segments [3][4][5][6][7]. This technique is used in this work because nonspeech segments can degrade recognition performance, especially at a low signal-to-noise ratio (SNR) [3][4][5].…”
Section: Introductionmentioning
confidence: 99%
“…This technique is used in this work because nonspeech segments can degrade recognition performance, especially at a low signal-to-noise ratio (SNR) [3][4][5]. In noisy environments, the noise will smear speech waveforms, thus a robust speech recognition algorithm, such as cepstrum mean subtraction (CMS) [10], running spectrum filtering (RSF), and dynamic range adjustment (DRA) [8][9][10][11], is required.…”
Section: Introductionmentioning
confidence: 99%