2007
DOI: 10.1007/s00521-007-0151-5
|View full text |Cite
|
Sign up to set email alerts
|

Nonlinear normalization of input patterns to speaker variability in speech recognition neural networks

Abstract: The issue of input variability resulting from speaker changes is one of the most crucial factors influencing the effectiveness of speech recognition systems. A solution to this problem is adaptation or normalization of the input, in a way that all the parameters of the input representation are adapted to that of a single speaker, and a kind of normalization is applied to the input pattern against the speaker changes, before recognition. This paper proposes three such methods in which some effects of the speake… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2009
2009
2016
2016

Publication Types

Select...
6

Relationship

3
3

Authors

Journals

citations
Cited by 15 publications
(11 citation statements)
references
References 6 publications
0
11
0
Order By: Relevance
“…The ratio of true distinctions in test data to all test data is introduced as accuracy of prediction. In the current study, comparison measurement was based on proper decrease of error and accuracy of prediction in test villages (12).…”
Section: Artificial Neural Network Modelingmentioning
confidence: 99%
“…The ratio of true distinctions in test data to all test data is introduced as accuracy of prediction. In the current study, comparison measurement was based on proper decrease of error and accuracy of prediction in test villages (12).…”
Section: Artificial Neural Network Modelingmentioning
confidence: 99%
“…Moreover, this projection is similar to the tonotopic organization of the human peripheral auditory system [43]. Therefore, it has been used in many ANN structures for speech recognition in diverse researches [37,43,44].…”
Section: Modular Deep Neural Network (Mdnn)mentioning
confidence: 91%
“…Although indicated feature space transformations improve the recognition performance of DNNs, they are linear and cannot deal with nonlinear variations of the speech signal. In the nonlinear feature normalization method proposed in [37], a feed-forward neural network is first trained to map the input representations into both phonetic and speaker codes. Then, a training speaker with the highest phone accuracy is considered as the reference speaker.…”
Section: Previous Work On Using Dnns In Speech Recognitionmentioning
confidence: 99%
See 1 more Smart Citation
“…This kind of neural network has also been successfully applied in clinical outcome prediction of myocardial infarction, mortality, surgical decision making on traumatic brain injury patients, recovery from surgery, pediatric, genecology, head trauma, and transplantation [7][8][9][10][11][12][13][14][15][16].…”
Section: Introductionmentioning
confidence: 99%