“…This method has been compared to a dataset of ground truth formant values obtained from manually derived and corrected formant trajectories. The results show that the formant trajectory estimation method produces highly accurate and reliable results [35,36]. Further evaluations of this method and comparison of its performance with other formant estimation methods are presented in [37,38].…”
“…This method has been compared to a dataset of ground truth formant values obtained from manually derived and corrected formant trajectories. The results show that the formant trajectory estimation method produces highly accurate and reliable results [35,36]. Further evaluations of this method and comparison of its performance with other formant estimation methods are presented in [37,38].…”
“…In this section, the importance of each formant in conveying accent is assessed. The influences of four formants on accents are ranked according to a normalized distance measure as shown in equation (1) [3]. This formula is proposed for ranking the formants of an accent A with reference to the formants of another accent B , as Equation (1) is used to obtain an estimate of the ranks of the contribution of formants to pairs of accents.…”
Section: Sensitive Frequency Analysismentioning
confidence: 99%
“…Usually, gender-dependent model is used to deal with the gender variability problem. The accent variability related to the differences of phonemic systems, phonotactic distributions, lexical realizations of words, phonetic realizations and so on is a more complex and challenging question [3].…”
Abstract.It is well known that performance of Chinese speech recognition system fluctuates sharply with variation of accents. Hence it is feasible to identify accents before recognition. Based on our previous analysis [1], it is discovered that the first two formants are more sensitive to Chinese accents than others. A modified Mel-frequency cepstral coefficients (MFCC) algorithm is proposed by increasing the filter distribution in lower and middle frequency range to accommodate the sensitivity of Chinese accents. Comparing with the GMM system based on traditional MFCC, the error rate of the GMM system based on modified MFCC declines by 1.8%.
“…A relatively simple formula (Yan et al, 2003) for ranking the influence of formants in characterizing the voice of a speaker A with reference to the formants of another speaker B, or that of standard voice, is proposed as…”
Section: Relative Influence Of Formant Parameters On Voicementioning
This paper presents a method for the estimation and mapping of parametric models of speech resonance at formants for voice conversion. The spectral features at formants that contribute to voice characteristics are the trajectories of the frequencies, the bandwidths and intensities of the resonance at formants. The formant features are extracted from the poles of a linear prediction (LP) model of speech. The statistical distributions of formants are modelled by a two-dimensional hidden Markov model (HMM) spanning the time and frequency dimensions. Experimental results are presented which show a close match between HMM-based formant models and the histograms of formants. For voice conversion two alternative methods are explored for mapping the formants of a source speaker to those of a target speaker. The first method is based on an adaptive formant-tracking warping of the frequency response of the LP model and the second method is based on the rotation of the poles of the LP model of speech. Both methods transform all spectral parameters of the resonance at formants of the source speaker towards those of the target speaker. In addition, the issues affecting the selection of the warping ratios for the mapping functions are investigated. Experimental results of formant estimation and perceptual evaluation of voice morphing based on parametric formant models are presented.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.