Adaptive Beamforming with a Maximum Negentropy Criterion

Kumatani, Kenichi; McDonough, John; Klakow, Dietrich; Garner, Philip N.; Liu, Weifeng

doi:10.1109/hscma.2008.4538716

Cited by 11 publications

(4 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Thus, a mixture signal which consists of many interference signals can be expected to be Gaussian-distributed. Based on these facts, we can remove interference signals and extract a target signal by making the pdf of the beamformer's output as super-Gaussian as possible [5].…”

Section: Super-gaussian Distributionsmentioning

confidence: 99%

“…Hence, noise and reverberation can be suppressed by adjusting the active weight vector of the GSC to provide a signal with the highest possible negentropy. We also demonstrated in [5] that maximum negentropy (MN) beamforming is free from the signal cancellation problem and provides the better recognition performance than conventional methods.…”

Section: Introductionmentioning

confidence: 99%

“…In [5], we proposed a new beamforming algorithm which adjusted the active weight vectors to maximize the negentropy of the beamformer's outputs. Negentropy indicates how far a probability density function (pdf) of a particular signal is from Gaussian.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Maximum kurtosis beamforming with the generalized sidelobe canceller

Kumatani¹,

McDonough²,

Rauch³

et al. 2008

Interspeech 2008

Self Cite

View full text Add to dashboard Cite

Section: Super-gaussian Distributionsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Maximum kurtosis beamforming with the generalized sidelobe canceller

Kumatani¹,

McDonough²,

Rauch³

et al. 2008

Interspeech 2008

Self Cite

View full text Add to dashboard Cite

“…In addition, MLLR [15] is performed to adapt Gaussian pa- rameters of the HMM to the speaker. In our prior work [3,8,16], we found that the combination of CMLLR and MLLR significantly improves the recognition of overlapped speech.…”

Section: Introductionmentioning

confidence: 97%

Joint constrained maximum likelihood regression for overlapping speech recognition

Kumatani¹,

Singh

Faubel

et al. 2013

2013 IEEE International Conference on Acoustics, Speech and Signal Processing

Self Cite

View full text Add to dashboard Cite

Adaptation techniques for speech recognition are very effective in single-speaker scenarios. However, when distant microphones capture overlapping speech from multiple speakers, conventional speaker adaptation methods are less effective. The putative signal for any speaker contains interference from other speakers. Consequently, any adaptation technique adapts the model to the interfering speakers as well, which leads to degradation of recognition performance for the desired speaker. In this work, we develop a new feature-space adaptation method for overlapping speech. We first build a beamformer to enhance speech from each active speaker. After that, we compute speech feature vectors from the output of each beamformer. We then jointly transform the feature vectors from all speakers to maximize the likelihood of their respective acoustic models. Experiments run on the speech separation challenge data collected under the AMI project demonstrate the effectiveness of our adaptation method. An absolute word error rate (WER) reduction up to 14 % was achieved in the case of delay-and-sum beamforming. With minimum mutual information (MMI) beamforming, our adaptation method achieved a WER of 31.5 %. To the best of our knowledge, this is the lowest WER reported on this task.

show abstract

Source Separation

2011

Speech and Audio Signal Processing

View full text Add to dashboard Cite

Adaptive Beamforming with a Maximum Negentropy Criterion

Cited by 11 publications

References 16 publications

Maximum kurtosis beamforming with the generalized sidelobe canceller

Maximum kurtosis beamforming with the generalized sidelobe canceller

Joint constrained maximum likelihood regression for overlapping speech recognition

Source Separation

Contact Info

Product

Resources

About