2020
DOI: 10.1007/978-3-030-63820-7_1
|View full text |Cite
|
Sign up to set email alerts
|

A Hybrid Representation of Word Images for Keyword Spotting

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 12 publications
0
2
0
Order By: Relevance
“…With the recent advent of deep learning-based KWS methods, a standard solution for example model architectures is to normalize input images to fixed size [69,104,121,166]. For instance, Wei and co-workers propose a normalization by resizing all input images to a standard size of 310 pixel width and 50 height in [121], whereas in [120], they resize all input images so that they have the same width (either pure or by padding white pixels) and aspect ratio. Wicht et al [150,151] normalize the word images to remove the skew and slant of the text using [228].…”
Section: Normalizationmentioning
confidence: 99%
See 1 more Smart Citation
“…With the recent advent of deep learning-based KWS methods, a standard solution for example model architectures is to normalize input images to fixed size [69,104,121,166]. For instance, Wei and co-workers propose a normalization by resizing all input images to a standard size of 310 pixel width and 50 height in [121], whereas in [120], they resize all input images so that they have the same width (either pure or by padding white pixels) and aspect ratio. Wicht et al [150,151] normalize the word images to remove the skew and slant of the text using [228].…”
Section: Normalizationmentioning
confidence: 99%
“…Even more promising results have been introduced by the recent bloom of adversarial learning and Generative Adversarial Networks (GANs) [319] which basically employ generative modeling to augment data in limited datasets. In our proposed method, we use a similar strategy to render KWS models such as the PHOCNet, robust [32,34,121,150,180,235]. In several computer vision problems, deep features have often led to superior performance over the standard use of the employed network.…”
Section: Adversarial Deep Feature Adaptationmentioning
confidence: 99%