Literature Review on Automatic Speech Recognition

Ghai, Wiqas; Singh, Navdeep

doi:10.5120/5565-7646

Cited by 48 publications

(24 citation statements)

References 24 publications

(18 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…In testing stage, the test sample is unknown and the acoustic analysis is performed. In order to discuss the stages involved in ASR architecture [6] [10], it is very important to be aware of the databases that serve as input to the ASR system. Figure 6 shows the architecture of ASR system.…”

Section: B Architecture Of Automatic Speech Recognition Systemmentioning

confidence: 99%

An Extensive Review of Feature Extraction Techniques, Challenges and Trends in Automatic Speech Recognition

Kanabur¹,

Harakannanavar²,

Torse³

2019

IJIGSP

View full text Add to dashboard Cite

Speech is the natural mode of communication between humans. Human-to-machine interaction is gaining importance in the past few decades which demands the machine to be able to analyze, respond and perform tasks at the same speed as performed by human. This task is achieved by Automatic Speech Recognition (ASR) system which is typically a speech-to-text converter. In order to recognize the areas of further research in ASR, one must be aware of the current approaches, challenges faced by each and issues that needs to be addressed. Therefore, in this paper human speech production mechanism is discussed. The various speech recognition techniques and models are addressed in detail. The performance parameters that measure the accuracy of the system in recognizing the speech signal are described.

show abstract

Section: B Architecture Of Automatic Speech Recognition Systemmentioning

confidence: 99%

An Extensive Review of Feature Extraction Techniques, Challenges and Trends in Automatic Speech Recognition

Kanabur¹,

Harakannanavar²,

Torse³

2019

IJIGSP

View full text Add to dashboard Cite

show abstract

“…Wiqas Ghai and Navdeep Singh continued the work further based on the different approaches followed i.e. acoustic-phonetic, pattern recognition, Knowledge Connectionist approach etc [20]. Vadwala et.al.…”

Section: Literature Reviewmentioning

confidence: 99%

Acoustics Speech Processing of Sanskrit Language

Kakodkar¹,

Borkar²

2018

IJCA

View full text Add to dashboard Cite

Speech processing (SP) is the latest trend in technology. An intelligent and precise human-machine interaction (HMI) is designed to engineer an automated, smart and secure application for household and commercial application. The existing methods highlight the absence of the speech processing in the under-resourced languages. The novelty of this work is that it presents a study of acoustic speech processing (ASP) using spectral components of Mel frequency cepstrum coefficient (MFCC) of Sanskrit language. A customized speech database is created as no generic database is available in Sanskrit. The processing method includes speech signal isolation, feature selection and extraction of selected features for applications. The speech is processed over a custom dataset consisting of Sanskrit speech corpus. The spectral features are calculated over 13 coefficients providing improved performance. The results obtained highlight the performance of the proposed system with the variation of the lifter parameter.

show abstract

“…Les SRAP comportent deux modules, un module d'extraction des paramètres acoustiques et un module de décodage [5]. Le module d'extraction de paramètres permet de convertir le signal de la parole en des vecteurs acoustiques [6].…”

Section: Introductionunclassified

A study of speech recognition system based on the Hidden Markov Model with Gaussian-Mixture

Hazem

Zouhir

Ouni

2014

2014 International Conference on Electrical Sciences and Technologies in Maghreb (CISTEM)

View full text Add to dashboard Cite

In this paper, we present a study of isolated word speech recognition system. The adopted system is based on the Hidden Markov Model with Gaussian Mixture (HMM-GM). We studied the recognition rate by varying the states number (3, 4, 5, 6 and 7 states) and the number of Gaussians per state (2, 4, 8, 12, 14 and 16 Gaussians) of Hidden Markov Model. We evaluated these recognition rates using two parameterization techniques Mel Frequency Cepstral Coefficients (MFCC) and Perceptual Linear Prediction (PLP). We have introduced the dynamic coefficients and the energy of the signal in order to achieve an improvement in the recognition rate. Mots clés -Systèmes de reconnaissance automatique de la parole ; Modèle de Markov Caché Multi-Gaussiennes; base TIMIT ; boite à outils HTK.I.

show abstract

Literature Review on Automatic Speech Recognition

Cited by 48 publications

References 24 publications

An Extensive Review of Feature Extraction Techniques, Challenges and Trends in Automatic Speech Recognition

An Extensive Review of Feature Extraction Techniques, Challenges and Trends in Automatic Speech Recognition

Acoustics Speech Processing of Sanskrit Language

A study of speech recognition system based on the Hidden Markov Model with Gaussian-Mixture

Contact Info

Product

Resources

About