Interspeech 2019 2019
DOI: 10.21437/interspeech.2019-2010
|View full text |Cite
|
Sign up to set email alerts
|

Keyword Spotting for Hearing Assistive Devices Robust to External Speakers

Abstract: Keyword spotting (KWS) is experiencing an upswing due to the pervasiveness of small electronic devices that allow interaction with them via speech. Often, KWS systems are speakerindependent, which means that any person -user or notmight trigger them. For applications like KWS for hearing assistive devices this is unacceptable, as only the user must be allowed to handle them. In this paper we propose KWS for hearing assistive devices that is robust to external speakers. A stateof-the-art deep residual network f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 9 publications
(22 citation statements)
references
References 14 publications
0
22
0
Order By: Relevance
“…In this way, Tang and Lin greatly outperformed, with less parameters, standard CNNs [28] in terms of KWS performance, establishing a new state-of-theart back in 2018. Their powerful deep residual architecture so-called res15 has been employed to carry out different KWS studies in areas like robustness for hearing assistive devices [128], [129], filterbank learning [82], and robustness to acoustic noise [130], among others.…”
Section: B Convolutional Neural Networkmentioning
confidence: 99%
See 4 more Smart Citations
“…In this way, Tang and Lin greatly outperformed, with less parameters, standard CNNs [28] in terms of KWS performance, establishing a new state-of-theart back in 2018. Their powerful deep residual architecture so-called res15 has been employed to carry out different KWS studies in areas like robustness for hearing assistive devices [128], [129], filterbank learning [82], and robustness to acoustic noise [130], among others.…”
Section: B Convolutional Neural Networkmentioning
confidence: 99%
“…( 4). This approach is preferred over picking classes yielding posteriors above a sensitivity (decision) threshold to be set, since experience tells [82], [128]- [130] that non-streaming deep KWS systems tend to produce very peaked posterior distributions. This might be attributed to the fact that nonstreaming systems do not have to deal with inter-class transition data as in the dynamic case (see the next subsection), but with well-defined, isolated class realizations.…”
Section: A Non-streaming Modementioning
confidence: 99%
See 3 more Smart Citations