2013 IEEE Workshop on Automatic Speech Recognition and Understanding 2013
DOI: 10.1109/asru.2013.6707766
|View full text |Cite
|
Sign up to set email alerts
|

Using proxies for OOV keywords in the keyword search task

Abstract: We propose a simple but effective weighted finite state transducer (WFST) based framework for handling out-ofvocabulary (OOV) keywords in a speech search task. Stateof-the-art large vocabulary continuous speech recognition (LVCSR) and keyword search (KWS) systems are developed for conversational telephone speech in Tagalog. Word-based and phone-based indexes are created from word lattices, the latter by using the LVCSR system's pronunciation lexicon. Pronunciations of OOV keywords are hypothesized via a standa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
60
0
2

Year Published

2015
2015
2020
2020

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 79 publications
(62 citation statements)
references
References 12 publications
0
60
0
2
Order By: Relevance
“…3, is the same as the Combined Kaldi system, except that the OOV terms were searched with the proxy words strategy of the Kaldi toolkit [55]. This strategy consists of substituting each OOV word of the search term with acoustically similar INV proxy words, getting rid of the need of a subword-based system for OOV term search.…”
Section: Proxy Kaldi Std System (Proxy Kaldi)mentioning
confidence: 99%
See 3 more Smart Citations
“…3, is the same as the Combined Kaldi system, except that the OOV terms were searched with the proxy words strategy of the Kaldi toolkit [55]. This strategy consists of substituting each OOV word of the search term with acoustically similar INV proxy words, getting rid of the need of a subword-based system for OOV term search.…”
Section: Proxy Kaldi Std System (Proxy Kaldi)mentioning
confidence: 99%
“…The OOV terms were handled using the proxy words strategy of the Kaldi toolkit [55]. As for INV terms, a two-pass strategy has been applied: In the first pass, the OOV terms were synthesized and recognized to create proxy word Finite State Transducers (FSTs) that are needed in the Kaldi term detector to produce a first set of OOV term detections.…”
Section: Synthesis and Syllabic Decomposition-based Combined Std Systmentioning
confidence: 99%
See 2 more Smart Citations
“…The keywords, however, are only limited to the words that are already defined in the LVCSR vocabulary. While effort has been made to make such systems keywordindependent [11,12,13,14], there is usually a performance degradation when the keywords are out of vocabulary. In addition, these approaches are relatively expensive because of the LVCSR system.…”
Section: Introductionmentioning
confidence: 99%