Peiyun Xue scite author profile

Environmental noise can pose a threat to the stable operation of current speech recognition systems. It is therefore essential to develop a front feature set that is able to identify speech under low signalto-noise ratio. In this paper, a robust fusion feature is proposed that can fully characterize speech information. To obtain the cochlear filter cepstral coefficients (CFCC), a novel feature is first extracted by the power-law nonlinear function, which can simulate the auditory characteristics of the human ear. Speech enhancement technology is then introduced into the front end of feature extraction, and the extracted feature and their first-order difference are combined in new mixed features. An energy feature Teager energy operator cepstral coefficient (TEOCC) is also extracted, and combined with the above-mentioned mixed features to form the fusion feature sets. Principal component analysis (PCA) is then applied to feature selection and optimization of the feature set, and the final feature set is used in a non-specific persons, isolated words, and smallvocabulary speech recognition system. Finally, a comparative experiment of speech recognition is designed to verify the advantages of the proposed feature set using a support vector machine (SVM). The experimental results show that the proposed feature set not only display a high recognition rate and excellent anti-noise performance in speech recognition, but can also fully characterize the auditory and energy information in the speech signals.INDEX TERMS Cochlear filter cepstral coefficients, Teager energy operators cepstral coefficients, principal component analysis, speech recognition.

show abstract

Research progress of diosgenin extraction from Dioscorea zingiberensis C. H. Wright: Inspiration of novel method with environmental protection and efficient characteristics

Zhang

Guo

Xue

et al. 2023

Steroids

View full text Add to dashboard Cite

Acoustic and kinematic analyses of Mandarin vowels in speakers with hearing impairment

Xue

Zhang

Bai

et al. 2017

Clinical Linguistics & Phonetics

View full text Add to dashboard Cite

The central aim of this experiment was to compare acoustic parameters, formant frequencies and vowel space area (VSA), in adolescents with hearing-impaired (HI) and their normal-hearing (NH) peers; for kinematic parameters, the movements of vocal organs, especially the lips, jaw and tongue, during vowel production were analysed. The participants were 12 adolescents with different degrees of hearing impairment. The control group consisted of 12 age-matched NH adolescents. All participants were native Chinese speakers who were asked to produce the Mandarin vowels /a/, /i/ and /u/, with subsequent acoustic and kinematic analysis. There was significant difference between the two groups. Additionally, the HI group produced more exaggerated mouth and less tongue movements in all vowels, compared to their NH peers. Results were discussed regarding possible relationship between acoustic data, articulatory movements and degree of hearing loss to provide an integrative assessment of acoustic and kinematic characteristics of individuals with hearing loss.

show abstract

Parameters Optimization and Application Research of v-Support Vector Machine

Bai¹,

Xue²,

Zhang³

et al. 2013

IJACT

View full text Add to dashboard Cite

Anti-noise Speech Recognition System Based on Improved MFCC Features and Wavelet Kernel SVM

Bai¹,

Xue²,

Zhang³

et al. 2012

AISS

View full text Add to dashboard Cite

Parameters Optimization and Application of v-Support Vector Machine Based on Particle Swarm Optimization Algorithm

Bai

Zhang

Xue

et al. 2012

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Peiyun Xue

Analysis and classification of the nasal finals in hearing-impaired patients using tongue movement features

Fusion Feature Extraction Based on Auditory and Energy for Noise-Robust Speech Recognition

Research progress of diosgenin extraction from Dioscorea zingiberensis C. H. Wright: Inspiration of novel method with environmental protection and efficient characteristics

Acoustic and kinematic analyses of Mandarin vowels in speakers with hearing impairment

Parameters Optimization and Application Research of v-Support Vector Machine

Anti-noise Speech Recognition System Based on Improved MFCC Features and Wavelet Kernel SVM

Parameters Optimization and Application of v-Support Vector Machine Based on Particle Swarm Optimization Algorithm

Contact Info

Product

Resources

About