Arabic speech recognition using SPHINX engine

Hyassat, Hussein; Zitar, Raed Abu

doi:10.1007/s10772-008-9009-1

Cited by 48 publications

(26 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…An Arabic ASR system was developed using sphinx toolkit in (Hyassat and Zitar 2006) [1] . Three Arabic corpuses HQC-1 -Holly Qura'an Corpus (18.5 hours), CAC1 -command and control corpus (1.5 hours) and ADC -Arabic digits corpus (less than 1 hour) were created for testing and training of the system.…”

Section: Related Workmentioning

confidence: 99%

“…Transcription is the representation of any language in written form or mapping of spoken words onto written words [1].…”

Section: Transcription Filementioning

confidence: 99%

“…These words are called as non-speech sounds. Filler dictionary must always contain these three entries <sil>, <s>, and </s> [1]. It may also contain non speech-sounds such as um, uh and breath noise sounds made during the speech.…”

Section: Building Kashmiri and Filler Dictionarymentioning

confidence: 99%

See 2 more Smart Citations

An experimental framework of speaker independent speech recognition system for Kashmiri language (K-ASR) system using Sphinx

Bhardwaj

Kadyan²,

Singh³

et al. 2017

ijetst

View full text Add to dashboard Cite

Abstract. Speech to text conversion in various languages have been performed so far but no process has defined for the Kashmiri language. There has been no research done on Kashmiri speech recognition. So in this work, we describe the development as well as implementation of first CMU Sphinx-3 based speech recognizer for the Kashmiri language. Recognition of the words have been done by using hidden markov models (HMMs). Dictionary consists of 100 words, representing Kashmiri digits from one (akh) to hundred (hat).Here, we developed a speaker independent, Kashmiri -Automatic Speech Recognition (K-ASR) system. The System is trained and tested for 1200 words spoken by 12 male and female speakers. Maximum Accuracy of 78.33% was achieved by the K-ASR system.

show abstract

Section: Related Workmentioning

confidence: 99%

“…Transcription is the representation of any language in written form or mapping of spoken words onto written words [1].…”

Section: Transcription Filementioning

confidence: 99%

See 1 more Smart Citation

An experimental framework of speaker independent speech recognition system for Kashmiri language (K-ASR) system using Sphinx

Bhardwaj

Kadyan²,

Singh³

et al. 2017

ijetst

View full text Add to dashboard Cite

show abstract

“…Their method reduced the WER to 9.81% for a diacritized transcription corpus, as they have reported. (Hyassat & Abu Zitar, 2008) described an Arabic speech recognition system based on Sphinx 4. Three corpora were developed, namely, the Holy Qura'an corpus of about 18.5 hours, the command and control corpus of about 1.5 hours, and the Arabic digits corpus of less than 1 hour of speech.…”

Section: Literature and Recent Workmentioning

confidence: 99%

Cross-Word Arabic Pronunciation Variation Modeling Using Part of Speech Tagging

AbuZeina¹,

Al-Muhtaseb²,

Elshafei³

2012

Modern Speech Recognition Approaches With Case Studies

View full text Add to dashboard Cite

“…In this work, an HMM-based public domain speech recognition development toolkit CMU sphinx (Hyassat & Abu Zitar 2008) is used for signal processing and acoustic modelling.…”

Section: Introductionmentioning

confidence: 99%

Connected digit speech recognition system for Malayalam language

Kurian

Balakrishnan

2013

Sadhana

View full text Add to dashboard Cite

Abstract.A connected digit speech recognition is important in many applications such as automated banking system, catalogue-dialing, automatic data entry, automated banking system, etc. This paper presents an optimum speaker-independent connected digit recognizer for Malayalam language. The system employs Perceptual Linear Predictive (PLP) cepstral coefficient for speech parameterization and continuous density Hidden Markov Model (HMM) in the recognition process. Viterbi algorithm is used for decoding. The training data base has the utterance of 21 speakers from the age group of 20 to 40 years and the sound is recorded in the normal office environment where each speaker is asked to read 20 set of continuous digits. The system obtained an accuracy of 99.5 % with the unseen data.

show abstract

Arabic speech recognition using SPHINX engine

Cited by 48 publications

References 17 publications

An experimental framework of speaker independent speech recognition system for Kashmiri language (K-ASR) system using Sphinx

An experimental framework of speaker independent speech recognition system for Kashmiri language (K-ASR) system using Sphinx

Cross-Word Arabic Pronunciation Variation Modeling Using Part of Speech Tagging

Connected digit speech recognition system for Malayalam language

Contact Info

Product

Resources

About