2022
DOI: 10.1109/access.2021.3139508
|View full text |Cite
|
Sign up to set email alerts
|

Deep Spoken Keyword Spotting: An Overview

Abstract: Spoken keyword spotting (KWS) deals with the identification of keywords in audio streams and has become a fast-growing technology thanks to the paradigm shift introduced by deep learning a few years ago. This has allowed the rapid embedding of deep KWS in a myriad of small electronic devices with different purposes like the activation of voice assistants. Prospects suggest a sustained growth in terms of social use of this technology. Thus, it is not surprising that deep KWS has become a hot research topic amon… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
31
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 41 publications
(31 citation statements)
references
References 225 publications
(612 reference statements)
0
31
0
Order By: Relevance
“…As shown in Figure 1 (B), the data collected by model A is used by model B for keyword spotting, and model B is run on the data collected by the model A. A detailed review on keyword spotting can be found in (López-Espejo et al, 2021). Running a keyword spotter on an utterance i will produce a score s i representing how likely it is to have detected the keyword.…”
Section: Ab/ba Analysismentioning
confidence: 99%
“…As shown in Figure 1 (B), the data collected by model A is used by model B for keyword spotting, and model B is run on the data collected by the model A. A detailed review on keyword spotting can be found in (López-Espejo et al, 2021). Running a keyword spotter on an utterance i will produce a score s i representing how likely it is to have detected the keyword.…”
Section: Ab/ba Analysismentioning
confidence: 99%
“…The normal class consist of 5100 original and 45900 augmented samples using various augmentation techniques that were detailed in [66]. However, for this work only 12100 samples of normal class were used to mitigate the issue of imbalance dataset and foul language data samples scarcity as data imbalance and rarity is a major issue for KWS systems [9]. Additionally, the effect of data augmentation has led to improving model's performance and robustness to noise [66].…”
Section: ) Mmutm Datasetmentioning
confidence: 99%
“…Spoken keyword spotting (KWS) is a fast-growing technology due to the increased usage often coupled with deep learning techniques that involves the identification of keywords in audio streams [9]. As a consequence of the rapid growth of human-machine interaction via voice, the social usage of this technology is expected to achieve sustainable growth.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations