Suvojit Manna scite author profile

Achieving better recognition rate for text in video action images is challenging due to multi-type texts with unpredictable backgrounds. We propose a new method for the classification of captions (which is edited text) and scene texts (which is part of an image in video images of Yoga, Concert, Teleshopping, Craft, and Recipe classes). The proposed method introduces a new fusion criterion-based on DCT and Fourier coefficients to extract features that represent good clarity and visibility of captions to separate them from scene texts. The variances for coefficients of corresponding pixels of DCT and Fourier images are computed to derive the respective weights. The weights and coefficients are further used to generate a fused image. Furthermore, the proposed method estimates sparsity in Canny edge image of each fused image to derive rules for classifying caption and scene texts. Lastly, the proposed method is evaluated on images of five above-mentioned action image classes to validate the derived rules. Comparative studies with the state-of-the-art methods on the standard databases show that the proposed method outperforms the existing methods in terms of classification. The recognition experiments before and after classification show that the recognition performance rate improves significantly after classification.

show abstract

Prediction of Diabetes Type-II Using a Two-Class Neural Network

Rakshit

Manna

Biswas

et al. 2017

View full text Add to dashboard Cite

Identifying Research Index (R⁺) by Efficient Ranking and Recommender System for Evaluation and Analysis of Trending Research

Debnath¹,

Manna²,

Datta³

et al. 2018

View full text Add to dashboard Cite

Smart bag tracking and alert system using RFID

Sarkar¹,

Manna²,

Datta

2017

View full text Add to dashboard Cite

LiSHT: Non-parametric Linearly Scaled Hyperbolic Tangent Activation Function for Neural Networks

Roy¹,

Manna²,

Dubey

et al. 2023

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Suvojit Manna

Attention-Based Adaptive Spectral–Spatial Kernel ResNet for Hyperspectral Image Classification

A statistical approach to predict flight delay using gradient boosted decision tree

LiSHT: Non-Parametric Linearly Scaled Hyperbolic Tangent Activation Function for Neural Networks

A New DCT-FFT Fusion Based Method for Caption and Scene Text Classification in Action Video Images

Prediction of Diabetes Type-II Using a Two-Class Neural Network

Identifying Research Index (R⁺) by Efficient Ranking and Recommender System for Evaluation and Analysis of Trending Research

Smart bag tracking and alert system using RFID

LiSHT: Non-parametric Linearly Scaled Hyperbolic Tangent Activation Function for Neural Networks

Contact Info

Product

Resources

About

Suvojit Manna

Attention-Based Adaptive Spectral–Spatial Kernel ResNet for Hyperspectral Image Classification

A statistical approach to predict flight delay using gradient boosted decision tree

LiSHT: Non-Parametric Linearly Scaled Hyperbolic Tangent Activation Function for Neural Networks

A New DCT-FFT Fusion Based Method for Caption and Scene Text Classification in Action Video Images

Prediction of Diabetes Type-II Using a Two-Class Neural Network

Identifying Research Index (R+) by Efficient Ranking and Recommender System for Evaluation and Analysis of Trending Research

Smart bag tracking and alert system using RFID

LiSHT: Non-parametric Linearly Scaled Hyperbolic Tangent Activation Function for Neural Networks

Contact Info

Product

Resources

About

Identifying Research Index (R⁺) by Efficient Ranking and Recommender System for Evaluation and Analysis of Trending Research