2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015
DOI: 10.1109/icassp.2015.7178869
|View full text |Cite
|
Sign up to set email alerts
|

The THUEE system for the openKWS14 keyword search evaluation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2015
2015
2015
2015

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 24 publications
0
2
0
Order By: Relevance
“…The baseline system S1 uses convolutional maxout neural network acoustic model [14,15] Among them, S1-S10 are based on Kaldi, while S11 is based on HTK. The language model of S1-10 is a word trigram language model, while S11 utilizes a feed-forward neural network language model with variance regularizations [19].…”
Section: Kws Systemsmentioning
confidence: 99%
“…The baseline system S1 uses convolutional maxout neural network acoustic model [14,15] Among them, S1-S10 are based on Kaldi, while S11 is based on HTK. The language model of S1-10 is a word trigram language model, while S11 utilizes a feed-forward neural network language model with variance regularizations [19].…”
Section: Kws Systemsmentioning
confidence: 99%
“…Best performance in the NIST Open KWS 2013 evaluation is ATWV=0.6248 [110] under the Full Language Pack (FullLP) condition, for which 20 h of word-transcribed scripted speech, 80 h of word-transcribed CTS, and a pronunciation lexicon were given to participants. In the works describing systems on the surprise language (i.e., Tamil) of the Open KWS 2014 evaluation [53,92,94,[111][112][113][114][115][116][117], ATWV=0.5802 is the best performance obtained under the FullLP condition, for which 60 h of transcribed speech and a pronunciation lexicon were given to participants.…”
Section: Comparison To Other Evaluationsmentioning
confidence: 99%