2006
DOI: 10.1016/j.specom.2006.06.008
|View full text |Cite
|
Sign up to set email alerts
|

A feature extraction method using subband based periodicity and aperiodicity decomposition with noise robust frontend processing for automatic speech recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2010
2010
2022
2022

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 17 publications
(7 citation statements)
references
References 22 publications
0
7
0
Order By: Relevance
“…However, the ability to hear tones in noise has important implications for speech intelligibility under noisy conditions (Plomp, 1994). In support of this, the detection of common periodicity has been successfully used to segment parts of noisy speech signals that contain vowel sounds (Hu and Wang, 2008), and the use of periodicity-based features was found to enhance the automatic speech recognition rates of voiced components of speech in the presence of noise (Ishizuka & Nakatani, 2006). Other researchers have considered the role of periodicity processing in segregating vowels by autocorrelation and similar algorithms such as recurrent neural networks (Assmann and Summerfield, 1990;Cariani, 2001;de Cheveigné and Kawahara, 1999;Meddis and Hewitt, 1992), and in the formation of stable auditory images of periodic sounds through the cross-channel temporal alignment of the maxima of auditory nerve spike rates (Patterson et al, 1995).…”
Section: Introductionmentioning
confidence: 93%
“…However, the ability to hear tones in noise has important implications for speech intelligibility under noisy conditions (Plomp, 1994). In support of this, the detection of common periodicity has been successfully used to segment parts of noisy speech signals that contain vowel sounds (Hu and Wang, 2008), and the use of periodicity-based features was found to enhance the automatic speech recognition rates of voiced components of speech in the presence of noise (Ishizuka & Nakatani, 2006). Other researchers have considered the role of periodicity processing in segregating vowels by autocorrelation and similar algorithms such as recurrent neural networks (Assmann and Summerfield, 1990;Cariani, 2001;de Cheveigné and Kawahara, 1999;Meddis and Hewitt, 1992), and in the formation of stable auditory images of periodic sounds through the cross-channel temporal alignment of the maxima of auditory nerve spike rates (Patterson et al, 1995).…”
Section: Introductionmentioning
confidence: 93%
“…This principle was first explored successfully in the architecture of deep auto encoder on the "raw" spectrogram or linear filter-bank features, [56] showing its superiority over the Mel-Cepstral features which contain a few stages of fixed transformation from spectrograms. The true "raw" features of speech, waveforms, have more recently been shown to produce excellent larger-scale speech recognition results [57].…”
Section: -2017smentioning
confidence: 99%
“…1. The outcomes of auditory comb filter hypothesis [12] and SPADE analysis in noisy speech recognition [11,14] have strongly inspired to develop this technique. The hypothesis can be implemented in WERB-SPADE by using comb filters and ERB like WP decomposition of acoustic speech signal.…”
Section: Sub-band Periodic and Aperiodic Decompositionmentioning
confidence: 99%
“…They have evaluated the performance of SPADE with AURORA-2J database in the presence of noise and claimed that proposed features have outperformed MFCC. Later Ishizuka and Nakatani [14] have expanded the SPADE analysis in frequency domain and have proposed new feature extraction technique named SPADE freQUEncy domain ENhancement (SPADE-QUEEN). They combined their proposed front end technique with different noise compensation techniques such as, spectral subtraction or Wiener filtering and studied the performance of robust front end technique with AURORA 2J database.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation