2006
DOI: 10.1007/s10772-008-9009-1
|View full text |Cite
|
Sign up to set email alerts
|

Arabic speech recognition using SPHINX engine

Abstract: Although the Arab world has an estimated number of 250 million Arabic speakers, there has been little research on Arabic speech recognition when compared to other languages of similar importance (e.g. Mandarin). Due to the lack of diacritic Arabic text and the lack of Pronunciation Dictionary (PD), most of previous work on Arabic Automatic Speech Recognition has been concentrated on developing recognizers using Romanized characters i.e. let the system recognizes the Arabic word as an English one, then map it t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
26
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 48 publications
(26 citation statements)
references
References 17 publications
0
26
0
Order By: Relevance
“…An Arabic ASR system was developed using sphinx toolkit in (Hyassat and Zitar 2006) [1] . Three Arabic corpuses HQC-1 -Holly Qura'an Corpus (18.5 hours), CAC1 -command and control corpus (1.5 hours) and ADC -Arabic digits corpus (less than 1 hour) were created for testing and training of the system.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…An Arabic ASR system was developed using sphinx toolkit in (Hyassat and Zitar 2006) [1] . Three Arabic corpuses HQC-1 -Holly Qura'an Corpus (18.5 hours), CAC1 -command and control corpus (1.5 hours) and ADC -Arabic digits corpus (less than 1 hour) were created for testing and training of the system.…”
Section: Related Workmentioning
confidence: 99%
“…Transcription is the representation of any language in written form or mapping of spoken words onto written words [1].…”
Section: Transcription Filementioning
confidence: 99%
See 1 more Smart Citation
“…Their method reduced the WER to 9.81% for a diacritized transcription corpus, as they have reported. (Hyassat & Abu Zitar, 2008) described an Arabic speech recognition system based on Sphinx 4. Three corpora were developed, namely, the Holy Qura'an corpus of about 18.5 hours, the command and control corpus of about 1.5 hours, and the Arabic digits corpus of less than 1 hour of speech.…”
Section: Literature and Recent Workmentioning
confidence: 99%
“…In this work, an HMM-based public domain speech recognition development toolkit CMU sphinx (Hyassat & Abu Zitar 2008) is used for signal processing and acoustic modelling.…”
Section: Introductionmentioning
confidence: 99%