2003
DOI: 10.1007/3-540-45011-4_2
|View full text |Cite
|
Sign up to set email alerts
|

AUDIMUS.MEDIA: A Broadcast News Speech Recognition System for the European Portuguese Language

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
27
0

Year Published

2006
2006
2022
2022

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 58 publications
(28 citation statements)
references
References 5 publications
1
27
0
Order By: Relevance
“…The baseline LM presents, over the test set (March11-B), an average perplexity (PP) of 122, an OOV word rate of 1.29% and a WER of 28.1%. The WER is higher than the normal evaluation test sets [4].…”
Section: New Datasetsmentioning
confidence: 73%
See 1 more Smart Citation
“…The baseline LM presents, over the test set (March11-B), an average perplexity (PP) of 122, an OOV word rate of 1.29% and a WER of 28.1%. The WER is higher than the normal evaluation test sets [4].…”
Section: New Datasetsmentioning
confidence: 73%
“…For the work presented in this paper, we used the system reported in [4]. This European Portuguese broadcast news transcription system features a hybrid HMM/MLP system, using three MLPs, each of them associated with a different feature extraction process, where the MLPs are used to estimate the context independent posterior phone probabilities given the acoustic data at each frame.…”
Section: Broadcast News Transcrition Systemmentioning
confidence: 99%
“…The phone recognizer is part of the AUDIMUS system [11], a hybrid recognizer that combines the temporal modeling capabilities of hidden Markov models with the pattern discriminative classification abilities of multi-layer Perceptrons. This phonetic decoding is applied to all the languages in the training database, resulting in Portuguese-phones sequences which are then modeled for each language by n-grams, using the SRI language modeling toolkit [12].…”
Section: Prlm Systemmentioning
confidence: 99%
“…There are 4 main blocks in this diagram: the ASR, the TTS, the FACE and the TM. The ASR is based on Audimus [5], a hybrid speech recognizer that combines the temporal modeling capabilities of Hidden Markov Models (HMMs) with the pattern discriminative classification capabilities of multilayer perceptrons (MLPs). This same recognizer is being used for different complexity tasks based on a common structure but with different components.…”
Section: Our Systemmentioning
confidence: 99%
“…This means that to control the devices the user has to start by the keyword "Ambrósio". The acoustic models of our Audimus [5] system are speaker independent.…”
Section: Asr Configurationmentioning
confidence: 99%