2002
DOI: 10.1109/89.985548
|View full text |Cite
|
Sign up to set email alerts
|

Speech pause detection for noise spectrum estimation by tracking power envelope dynamics

Abstract: A speech pause detection algorithm is an important and sensitive part of most single-microphone noise reduction schemes for enhancement of speech signals corrupted by additive noise as an estimate of the background noise is usually determined when speech is absent. An algorithm is proposed which detects speech pauses by adaptively tracking minima in a noisy signal's power envelope both for the broadband signal and for the high-pass and low-pass filtered signal. In poor signal-to-noise ratios (SNRs), the propos… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
119
0
8

Year Published

2006
2006
2013
2013

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 156 publications
(129 citation statements)
references
References 18 publications
(15 reference statements)
2
119
0
8
Order By: Relevance
“…The working points of the ITU-T G.729, ETSI AMR and ETSI AFE VADs are also included. The results show improvements in speech detection accuracy over standard VADs and a representative set of recently reported VAD algorithms [20,21,19,6].…”
Section: Methodsmentioning
confidence: 70%
See 2 more Smart Citations
“…The working points of the ITU-T G.729, ETSI AMR and ETSI AFE VADs are also included. The results show improvements in speech detection accuracy over standard VADs and a representative set of recently reported VAD algorithms [20,21,19,6].…”
Section: Methodsmentioning
confidence: 70%
“…The non-speech hit rate (HR0) and the false alarm rate (FAR0= 100-HR1) were determined for each noisy condition being the actual speech frames and actual speech pauses determined by hand-labelling the database on the close-talking microphone. Figure 5 shows the ROC curves of the proposed VAD and other frequently referred algorithms [20,21,19,6] for recordings from the distant microphone in quiet and high noisy conditions. The working points of the ITU-T G.729, ETSI AMR and ETSI AFE VADs are also included.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…If the input speech produced by the alike speaker or similar environmental condition from the reference patterns, the system is likely to find the correct pattern otherwise not [13], [14]. The few valuable approaches used for feature extraction are: full-band and subband energies [15], spectrum divergence between speech signal and noise in background [16], pitch estimation [17], zero crossing rate [18], and higher-order statistics [19], [20], [21], [22]. However, using longterm speech information [23], [24] has shown considerable benefits for identifying speech signal in noisy environments.…”
Section: Speech Recognitionmentioning
confidence: 99%
“…During the last decade, numerous researchers have developed different strategies for detecting speech on a noisy signal (Sohn et al, 1999), (Cho & Kondoz 2001) and have evaluated the influence of the VAD effectiveness on the performance of speech processing systems (Bouquin-Jeannes & Faucon, 1995) (see also the preceding chapter about VAD). Most of them have focussed on the development of robust algorithms with special attention on the derivation and study of noise robust features and decision rules (Woo et al, 2000), (Li et al, 2002), (Marzinzik & Kollmeier, 2002), (Sohn et al, 1999). The different approaches include those based on energy thresholds (Woo et al, 2000), pitch detection (Chengalvarayan, 1999), spectrum analysis (Marzinzik & Kollmeier, 2002), zero-crossing rate (ITU, 1996), periodicity measures (Tucker, 1992) or combinations of different features (ITU, 1996), (ETSI, 1999).…”
Section: Introductionmentioning
confidence: 99%