2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07 2007
DOI: 10.1109/icassp.2007.367310
|View full text |Cite
|
Sign up to set email alerts
|

Speech Recognition System Combination for Machine Translation

Abstract: The majority of state-of-the-art speech recognition systems make use of system combination. The combination approaches adopted have traditionally been tuned to minimising Word Error Rates (WERs). In recent years there has been growing interest in taking the output from speech recognition systems in one language and translating it into another. This paper investigates the use of cross-site combination approaches in terms of both WER and impact on translation performance. In addition the stages involved in modif… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
1

Year Published

2007
2007
2018
2018

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 11 publications
0
8
1
Order By: Relevance
“…However, the system combination yields slightly better results. These results differ from those in [21], although different tasks and error metrics were used.…”
Section: Resultscontrasting
confidence: 80%
“…However, the system combination yields slightly better results. These results differ from those in [21], although different tasks and error metrics were used.…”
Section: Resultscontrasting
confidence: 80%
“…When component systems use different word segmentation schemes, a direct combination between their outputs is problematic, for example, in Chinese where different character to word segmentations are used. Hence, for the Mandarin speech recognition tasks considered here, the most successful approach is to perform a character level combination, [3][4][5]11 as is also considered in this paper. This requires the mapping of outputs from a standard word based system to sub-word, character level.…”
Section: Hypothesis Level System Combinationmentioning
confidence: 99%
“…2 Therefore, the character error rate (CER) is the commonly used evaluation metric for state-of-the-art Mandarin speech recognition systems. [3][4][5] All languages have constrained syllable constructions and syllable sequence rules which enhance intelligibility. 6 These phonological and pragmatic constraints can be exploited for Chinese speech recognition.…”
Section: Introductionmentioning
confidence: 99%
“…Concerning the evaluation of text recognition, we rely on standard metrics of speech recognition such as Character Error Rate (CER) and Word Error Rate (WER) for determining the recognition accuracy of the segmentation free OCR output [91]. Note that we measure the quality of the generated pseudo ground-truth only by counting the number of validated text lines because, in our experiment, these data are automatically generated (no manual ground-truth available).…”
Section: Text Recognitionmentioning
confidence: 99%