2012
DOI: 10.5923/j.ajis.20120201.01
|View full text |Cite
|
Sign up to set email alerts
|

A Comparitive Survey of ANN and Hybrid HMM/ANN Architectures for Robust Speech Recognition

Abstract: This paper proposes two hybrid connectionist structural acoustical models for robust context independent phone like and word like units for speaker-independent recognition system. Such structure combines strength of Hidden Markov Models (HMM) in modeling stochastic sequences and the non-linear classification capability of Artificial Neural Networks (ANN). Two kinds of Neural Networks (NN) are investigated: Multilayer Perceptron (MLP) and Elman Recurrent Neural Networks (RNN). The hybrid connectionist-HMM syste… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(4 citation statements)
references
References 22 publications
0
4
0
Order By: Relevance
“…A new approach towards high performance speech/music discrimination on realistic tasks related to the automatic transcription of broadcast news is described in (Frikha and Hamida, 2012), in which an Artificial Neural Network (ANN) and HIDDEN Markov Model (HMM) are used. Subashini et al (2012), a generic audio classification and segmentation approach for multimedia indexing and retrieval is described.…”
Section: Related Workmentioning
confidence: 99%
“…A new approach towards high performance speech/music discrimination on realistic tasks related to the automatic transcription of broadcast news is described in (Frikha and Hamida, 2012), in which an Artificial Neural Network (ANN) and HIDDEN Markov Model (HMM) are used. Subashini et al (2012), a generic audio classification and segmentation approach for multimedia indexing and retrieval is described.…”
Section: Related Workmentioning
confidence: 99%
“…Researchers around the world are still trying to build various methods and algorithms that are robust and have high accuracy in speech recognition. Some research on speech recognition includes Speech recognition with artificial neural networks with the method of voice recognition with Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques [1], Voice recognition using Hidden Markov Mode [2] which results in accuracy up to 86.67%, Research speech recognition by combining the Artificial Neural Network method with Hidden Markov Model [3], Hindi voice recognition with Hidden Markov Model [4], Voice recognition for the biometric field with the Vector Quantization method [5]. Research on voice recognition in the groundwater was also carried out using Mel-Frequency Cepstrum Coefficients (MFCC) and Adaptive Neuro-Fuzzy Inferense System (ANFIS) resulting in an accuracy rate of 95.90% [6].…”
Section: Introductionmentioning
confidence: 99%
“…This is due to the efficiency of the HMMs to model the variation in the statistical properties of speech, both in the time and the frequency domains [40]. …”
Section: Introductionmentioning
confidence: 99%
“…HMMs were considered for this work because these are the most frequent techniques used for recognition of normal and disordered speech. This is due to the efficiency of the HMMs to model the variation in the statistical properties of speech, both in the time and the frequency domains [ 40 ].…”
Section: Introductionmentioning
confidence: 99%