2007 IEEE Workshop on Automatic Speech Recognition &Amp; Understanding (ASRU) 2007
DOI: 10.1109/asru.2007.4430154
|View full text |Cite
|
Sign up to set email alerts
|

Advances in Arabic broadcast news transcription at RWTH

Abstract: This paper describes the RWTH speech recognition system for Arabic. Several design aspects of the system, including cross-adaptation, multiple system design and combination, are analyzed. We summarize the semi-automatic lexicon generation for Arabic using a statistical approach to grapheme-tophoneme conversion and pronunciation statistics. Furthermore, a novel ASR-based audio segmentation algorithm is presented. Finally, we discuss practical approaches for parallelized acoustic training and memory efficient la… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2008
2008
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(14 citation statements)
references
References 11 publications
0
14
0
Order By: Relevance
“…Many developments have been done in the field of voice recognition [9] and mostly all the developed proposed methods are based on: Voice analysis, feature extraction, modeling and matching [10]. These methods [11], [12], [13] and [14] are mostly suffering from the efforts needed to perform the process of voice recognition and the lowest recognition ratio. A method proposed in [15] is a good example of one of the current development which is based on extracting voice features based on voice analysis to calculate some parameters such as estimated population (mu), dynamic range, peak factor, Power spectral density, and zero crossing rate.…”
Section: Fig 2: Histogram Of the Birdwavmentioning
confidence: 99%
“…Many developments have been done in the field of voice recognition [9] and mostly all the developed proposed methods are based on: Voice analysis, feature extraction, modeling and matching [10]. These methods [11], [12], [13] and [14] are mostly suffering from the efforts needed to perform the process of voice recognition and the lowest recognition ratio. A method proposed in [15] is a good example of one of the current development which is based on extracting voice features based on voice analysis to calculate some parameters such as estimated population (mu), dynamic range, peak factor, Power spectral density, and zero crossing rate.…”
Section: Fig 2: Histogram Of the Birdwavmentioning
confidence: 99%
“…The first two bnad06 and bcad06 were defined by BBN technology and consist of about 3 hours of BN and BC data respectively (collected during Dec05-Jan06). These test sets comprise com- 3 For the SPron system this only distinguished between a silence model being at the end of a word or not. …”
Section: Multi-pass Combination Frameworkmentioning
confidence: 99%
“…As part of the training process it is necessary to obtain pronunciations for words that can not be handled by Buckwalter [4,3]. A series of rules were automatically generated from a 250K Buckwalter derived phonetic dictionary.…”
Section: Automatic Pronunciation Generationmentioning
confidence: 99%
See 2 more Smart Citations