Mismatched training and testing conditions for speaker identification exist when speech is subjected to a different channel for the two cases. This results in diminished speaker identification performance. Finding features that show little variability to the filtering effect of different channels will make speaker identification systems more robust thereby achieving a better performance. It has been shown that subtracting the mean of the pole filtered linear predictive (LP) cepstrum from the actual LP cepstrum results in a robust feature. This feature is known as the pole filtered mean removed LP cepstrum. Another robust feature is the adaptive component weighted (ACW) cepstrum particularly with mean removal. In this paper, we combine the ACW cepstrum with the pole filtering concept to configure a more robust new feature, namely, the pole filtered mean removed ACW cepstrum. This new method is fast and shows a higher performance then the pole filtered mean removed LP cepstrum and the mean removed ACW cepstrum. Experimental results are given for the TIMIT database involving a variety of mismatched conditions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.