2005
DOI: 10.1109/tsa.2005.853005
|View full text |Cite
|
Sign up to set email alerts
|

Processing of reverberant speech for time-delay estimation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
37
0
1

Year Published

2007
2007
2013
2013

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 70 publications
(38 citation statements)
references
References 18 publications
0
37
0
1
Order By: Relevance
“…The epochs can be used as pitch markers for prosody manipulation, which is useful in applications like text-to-speech synthesis, voice conversion and speech rate conversion [3], [4]. Knowledge of the epoch locations may be used for estimating the time-delay between speech signals collected over a pair of spatially distributed microphones [5]. The segmental signal-to-noise ratio (SNR) of the speech signal is high in the regions around epochs, and hence, it is possible to enhance the speech by exploiting the characteristics of speech signals around the epochs [6].…”
Section: A Significance Of Epochs In Speech Analysismentioning
confidence: 99%
“…The epochs can be used as pitch markers for prosody manipulation, which is useful in applications like text-to-speech synthesis, voice conversion and speech rate conversion [3], [4]. Knowledge of the epoch locations may be used for estimating the time-delay between speech signals collected over a pair of spatially distributed microphones [5]. The segmental signal-to-noise ratio (SNR) of the speech signal is high in the regions around epochs, and hence, it is possible to enhance the speech by exploiting the characteristics of speech signals around the epochs [6].…”
Section: A Significance Of Epochs In Speech Analysismentioning
confidence: 99%
“…The step-sizes have been chosen such that all algorithms reach same asymptotic NPM. As before, true delays for direct-paths have been employed for ext-NMCFLMS [8] while we have employed GCC with PHAT prefilter of the Hilbert envelope of LP residual of speech [14] to estimate TDOA of direct-paths for ext-NMCFLMSDPE. After initial convergence, NMCFLMS and ext-NMCFLMS misconverge whereas ext-NMCFLMSDPE avoids misconvergence.…”
Section: Simulation Resultsmentioning
confidence: 99%
“…For reverberant speech, an effective method has been proposed in [14] which performs GCC on the Hilbert envelope of linear prediction (LP) residual of input speech.…”
Section: The Gcc Algorithmmentioning
confidence: 99%
“…A class of temporal processing methods have been proposed by exploiting the excitation source characteristics of the speech signal for the enhancement (Yegnanarayana et al 1999;Yegnanarayana & Satyanarayana Murthy 2000;Yegnanarayana et al 2003Yegnanarayana et al , 2005. Linear prediction (LP) residual obtained by inverse filtering the speech is used as an estimate of the source of excitation of the vocal tract system (Yegnanarayana et al 1999).…”
Section: Motivation For the Combined Temporal And Spectral Processingmentioning
confidence: 99%