2020
DOI: 10.1007/978-3-030-59830-3_7
|View full text |Cite
|
Sign up to set email alerts
|

A New DCT-FFT Fusion Based Method for Caption and Scene Text Classification in Action Video Images

Abstract: Achieving better recognition rate for text in video action images is challenging due to multi-type texts with unpredictable backgrounds. We propose a new method for the classification of captions (which is edited text) and scene texts (which is part of an image in video images of Yoga, Concert, Teleshopping, Craft, and Recipe classes). The proposed method introduces a new fusion criterion-based on DCT and Fourier coefficients to extract features that represent good clarity and visibility of captions to separat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(5 citation statements)
references
References 16 publications
(45 reference statements)
0
5
0
Order By: Relevance
“…However, the methods are not tested on action images without text information. Recently, the method [42] proposes the combination of Discrete Cosine Transform and Fast Fourier Transform for classifying caption and scene texts in action images to improve text recognition results. The method generates a fused image for the input and then the average of sparsity and non-sparsity counts in terms pixel values of zero or non-zeros is computed for classification.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…However, the methods are not tested on action images without text information. Recently, the method [42] proposes the combination of Discrete Cosine Transform and Fast Fourier Transform for classifying caption and scene texts in action images to improve text recognition results. The method generates a fused image for the input and then the average of sparsity and non-sparsity counts in terms pixel values of zero or non-zeros is computed for classification.…”
Section: Related Workmentioning
confidence: 99%
“…6 Sample images of successful classification of the proposed modelon our dataset. Original Source: [42] https://doi.org/10.1007/s42452-021-04821-z of classes increases, the complexity of the problem also increases. But if we consider the overall performance in terms of classification rate, the proposed method outperforms the others.…”
Section: Concertmentioning
confidence: 99%
See 3 more Smart Citations