A block least squares approach to acoustic echo cancellation

Woudenberg, E.; Soong, Frank K.; Juang, Biing‐Hwang

doi:10.1109/icassp.1999.759809

Cited by 5 publications

(4 citation statements)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Crosscorrelation analysis suggests a different approach to the problem of crosstalk, namely, estimating the coupling between different channels and using the estimates to cancel the crosstalk signals. We are investigating such an approach based on the Block Least Squares algorithm described in [7]. However, the situation is complicated by the very rapid changes in coupling that occur when speakers or listeners move their heads.…”

Section: Discussionmentioning

confidence: 99%

Multispeaker speech activity detection for the ICSI meeting recorder

Pfau

Ellis

Stolcke

IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01.

View full text Add to dashboard Cite

As part of a project into speech recognition io meeting envimn-men% we have collected a corpus of m u l t i z h l meeting recordings. We expccted the identification of speaker activity to be straightfonvard given that the participants had individual microphones, but simplc approaches yielded unacceptably erroneous labelings, mainly due to crosstalk between nearby speakers and wide variations in channel characteristics. Therefore, we have developed a more sophisticated approach for multichannel speech activity detection using a simple hidden Markov model (HMM) A baseline HMM speech activity detector has been extended to use mixtures of Gaussianr to achieve robustness for different speaken under different conditions. Feature normalition and crosscornlalion processing are used to increase the channel independence and to detect crosstalk. W e use of both energy normalization and crosscorrelation based postprocessing results in a 35% relative reduction of the frame error rate. Speech recognition experiments show that it is beneficial in this multispeaker setting to use the output of the speech activity detector for presegmenting the recognizer input, achieving word mor rates within 10% of those achieved with manual turn labeling. [8] Shr~nberg, E.. Stolckc. A , and Baron. D.. "Observations on avcrlsp' Findings and rmplicationr for n~t o r n i l t i~ processing ofmulti-pany c w w~t i o n " . Proc Ewospeah-2001, Aalburg. 0-7803-7343-X/02/$17.00 Q 2002 IEEE I10

show abstract

Section: Discussionmentioning

confidence: 99%

Multispeaker speech activity detection for the ICSI meeting recorder

Pfau

Ellis

Stolcke

IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01.

View full text Add to dashboard Cite

show abstract

“…If basis functions are complex harmonics, the echo path is modeled as a FIR filter with Nϕ filter taps, R is the auto-correlation matrix of the excitation signal, and ξ is the cross-correlation vector of the excitation and microphone signals. This case corresponds to the block LS approach adopted in [3]. Unfortunately, for harmonic basis functions, the matrix R in general is not sparse, which makes solving the system (7) with large Nϕ a complicated problem.…”

Section: Spectral-domain Spline-identificationmentioning

confidence: 99%

“…The vectors α = [α(1), α(2), α (3)] and ρ = [ρ(1), ρ(2), ρ (3)] are chosen to obtain the best cancellation performance. Other dependencies αt on ε 2 and Ey can also be used.…”

Section: Double-talk Detectionmentioning

confidence: 99%

“…The Fast AP algorithm has been proposed [2], but it demonstrates numerical instability and it is sensitive to noise. Good cancellation performance is achieved by using the least squares (LS) block approach [3]. However, even with the Toeplitz approximation of normal equations in the LS problem and use of the computationally efficient Levinson algorithm, the complexity of the echo canceler is still high.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Spectral Domain B-spline Identification in Acoustic Echo Cancellation

Zakharov¹,

Tozer²

Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.

View full text Add to dashboard Cite

Spectral domain B-spline identification is proposed for acoustic echo cancellation. Two approaches are considered. The first is based on solution of normal equations; we describe an efficient technique for such a solution, which benefits from the sparseness of the system matrix due to B-splines. The second approach is based on using local splines, enabling further simplification. We also show how the proposed techniques can be used for efficient double-talk detection. The echo cancellation performance and complexity of the proposed techniques are compared with that of a low-complexity cross-spectral technique and the affine projection (AP) algorithm possessing high cancellation performance. The Bspline identification allows cancellation performance comparable with that of the AP algorithm and complexity close to that of the cross-spectral algorithm.

show abstract

Ubiquitous speech communication interface

Juang

IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01.

View full text Add to dashboard Cite

A block least squares approach to acoustic echo cancellation

Cited by 5 publications

References 5 publications

Multispeaker speech activity detection for the ICSI meeting recorder

Multispeaker speech activity detection for the ICSI meeting recorder

Spectral Domain B-spline Identification in Acoustic Echo Cancellation

Ubiquitous speech communication interface

Contact Info

Product

Resources

About