2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018
DOI: 10.1109/icassp.2018.8462166
|View full text |Cite
|
Sign up to set email alerts
|

Time-Delayed Bottleneck Highway Networks Using a DFT Feature for Keyword Spotting

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
33
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 42 publications
(33 citation statements)
references
References 12 publications
0
33
0
Order By: Relevance
“…Eqs. (3) to (7) (with the number of positive targets n=1) and (8) to (9) define the loss for the decoder submodel. ] includes actual end-point of the keyword.…”
Section: Smoothed Max Pooling Loss For Decodermentioning
confidence: 99%
“…Eqs. (3) to (7) (with the number of positive targets n=1) and (8) to (9) define the loss for the decoder submodel. ] includes actual end-point of the keyword.…”
Section: Smoothed Max Pooling Loss For Decodermentioning
confidence: 99%
“…We use 993 utterances from the data. 'Hey Snapdragon' utterances are from a publicly available dataset 1 . There are 50 speakers and each of them speaks the keyword 22 or 23 times.…”
Section: Query and Testing Datamentioning
confidence: 99%
“…They collect numerous variations of a specific keyword utterance and train neural networks (NNs) which have been promising method in the field. [1,2] have acoustic encoder and sequence matching decoder as separate modules. The NN-based acoustic models (AMs) predict senone-level Qualcomm AI Research is an initiative of Qualcomm Technologies, Inc posteriors.…”
Section: Introductionmentioning
confidence: 99%
“…Prior work on FTM mainly comprises of research on key-word spotting and wake-up word detection. VT detection approaches typically rely on multi-stage neural network based processing of acoustic features to determine the presence of the wake-word [1,3,4,5,6]. These approaches often use ASR as an auxiliary task to aid the VT detection task.…”
Section: Introductionmentioning
confidence: 99%