2013 IEEE International Conference on Acoustics, Speech and Signal Processing 2013
DOI: 10.1109/icassp.2013.6639280
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting diversity for spoken term detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
46
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 43 publications
(46 citation statements)
references
References 15 publications
0
46
0
Order By: Relevance
“…However, this work was performed on large-resource languages and most of the effort focused on clean speech. It has recently been demonstrated that significant improvement on STD task can be obtained by deliberately designing diverse and complementary ASR components (i.e., front ends, acoustic models, etc) [2]. We show that similar approach works on noisy speech for lowresource languages with low target false alarm rate.…”
Section: Introductionmentioning
confidence: 89%
See 2 more Smart Citations
“…However, this work was performed on large-resource languages and most of the effort focused on clean speech. It has recently been demonstrated that significant improvement on STD task can be obtained by deliberately designing diverse and complementary ASR components (i.e., front ends, acoustic models, etc) [2]. We show that similar approach works on noisy speech for lowresource languages with low target false alarm rate.…”
Section: Introductionmentioning
confidence: 89%
“…This normalization scheme was proposed for IR data fusion in [9] and showed improvement for meta-search. It was used successfully for the first time in STD in [2]. A variant of scheme was initially investigated for IR in [10].…”
Section: Score Normalization Methodologiesmentioning
confidence: 99%
See 1 more Smart Citation
“…Before MTWV scoring these values were further normalised using a sum-to-one approach which ensures that the sum over the test set of the the scores for each keyword sum to unity. More details of the approach are given in [29].…”
Section: Abstractearch Systemmentioning
confidence: 99%
“…The limited data corresponding to some languages covered in the program (Cantonese, Pashto, Turkish, Tagalog, Vietnamese, Assamese, Bengali, Haitian Creole, Lao, and Zulu) were used for system training. The system is based on multi-lingual bottle-neck DNNs and Hidden Markov Model Toolkit (HTK) [83] for training and decoding and the IBM keyword search system for term detection [84]. Results showed that INV term performance is good for languages (e.g., Haitian Creole) whose phonetic structure is similar to that of the languages used for system training.…”
Section: Spoken Term Detection Under the Iarpa Babel Program And Openmentioning
confidence: 99%