Ritesh Sarkhel scite author profile

Classifying heterogeneous visually rich documents is a challenging task. Difficulty of this task increases even more if the maximum allowed inference turnaround time is constrained by a threshold. The increased overhead in inference cost, compared to the limited gain in classification capabilities make current multi-scale approaches infeasible in such scenarios. There are two major contributions of this work. First, we propose a spatial pyramid model to extract highly discriminative multi-scale feature descriptors from a visually rich document by leveraging the inherent hierarchy of its layout. Second, we propose a deterministic routing scheme for accelerating end-to-end inference by utilizing the spatial pyramid model. A depth-wise separable multi-column convolutional network is developed to enable our method. We evaluated the proposed approach on four publicly available, benchmark datasets of visually rich documents. Results suggest that our proposed approach demonstrates robust performance compared to the state-of-the-art methods in both classification accuracy and total inference turnaround.

show abstract

Multiobjective optimization for recognition of isolated handwritten Indic scripts

Gupta¹,

Sarkhel

Das

et al. 2019

Pattern Recognition Letters

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ritesh Sarkhel

A multi-objective approach towards cost effective isolated handwritten Bangla character and digit recognition

A multi-scale deep quad tree based feature extraction method for the recognition of isolated handwritten characters of popular indic scripts

An improved Harmony Search Algorithm embedded with a novel piecewise opposition based learning algorithm

Deterministic Routing between Layout Abstractions for Multi-Scale Classification of Visually Rich Documents

Multiobjective optimization for recognition of isolated handwritten Indic scripts

Contact Info

Product

Resources

About