1997 IEEE International Conference on Acoustics, Speech, and Signal Processing
DOI: 10.1109/icassp.1997.599651
|View full text |Cite
|
Sign up to set email alerts
|

A robust method for speech signal time-delay estimation in reverberant rooms

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
211
0
2

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 303 publications
(225 citation statements)
references
References 9 publications
0
211
0
2
Order By: Relevance
“…The performance of the GCC approach is dependent on the design of the prefilters whose main function is to improve the accuracy of peak detection. We employed the phase transform (PHAT) prefilter which has shown to give satisfactory TDOA estimates in the presence of reverberation [13]. The estimated time delay between the ith and jth channel is given by τ = arg max…”
Section: The Gcc Algorithmmentioning
confidence: 99%
“…The performance of the GCC approach is dependent on the design of the prefilters whose main function is to improve the accuracy of peak detection. We employed the phase transform (PHAT) prefilter which has shown to give satisfactory TDOA estimates in the presence of reverberation [13]. The estimated time delay between the ith and jth channel is given by τ = arg max…”
Section: The Gcc Algorithmmentioning
confidence: 99%
“…2 because in speaker diarization we do not know the number of speakers or their locations. Therefore we use a modified version of the Generalized Cross Correlation (GCC) called "generalized cross correlation with phase transform" (GCC-PHAT) (see [7]). …”
Section: Tdoa Estimation Via Gcc-phatmentioning
confidence: 99%
“…To obtain the Time Delay of Arrival (TDOA) for each segment of a channel, the GCC-PHAT ( [7]) is computed between the segment and the corresponding segment in the reference channel. Such a measure is more robust and accurate than cross-correlation when the noise level is low and it outputs values normalized from 0 to 1.…”
Section: Robust Tdoa Estimationmentioning
confidence: 99%
See 1 more Smart Citation
“…We used a modified version of the Generalized Cross Correlation with phase transform (GCC PHAT (f)) (see [11]) and estimate the delays between microphones with the following formula:…”
Section: Introductionmentioning
confidence: 99%