Alex Ufkes scite author profile

Abstract-This paper presents a new state-of-the-art for document image classification and retrieval, using features learned by deep convolutional neural networks (CNNs). In object and scene analysis, deep neural nets are capable of learning a hierarchical chain of abstraction from pixel inputs to concise and descriptive representations. The current work explores this capacity in the realm of document analysis, and confirms that this representation strategy is superior to a variety of popular hand-crafted alternatives. Experiments also show that (i) features extracted from CNNs are robust to compression, (ii) CNNs trained on non-document images transfer well to document analysis tasks, and (iii) enforcing region-specific feature-learning is unnecessary given sufficient training data. This work also makes available a new labelled subset of the IIT-CDIP collection, containing 400,000 document images across 16 categories, useful for training new CNNs for document analysis.

show abstract

Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval

Harley¹,

Ufkes²,

Derpanis³

2015

Preprint

View full text Add to dashboard Cite

Initial experiments on 3D modeling of complex disaster environments using unmanned aerial vehicles

Ferworn

Tran

Ufkes

et al. 2011

View full text Add to dashboard Cite

A Markerless Augmented Reality System for Mobile Devices

Ufkes

Fiala

2013

View full text Add to dashboard Cite

Visual Odometry Using 3-Dimensional Video Input

Fiala

Ufkes

2011

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Alex Ufkes

Evaluation of deep convolutional nets for document image classification and retrieval

Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval

Initial experiments on 3D modeling of complex disaster environments using unmanned aerial vehicles

A Markerless Augmented Reality System for Mobile Devices

Visual Odometry Using 3-Dimensional Video Input

Contact Info

Product

Resources

About