Raymond Ptucha scite author profile

Convolutional Neural Networks (CNNs) have recently led to incredible breakthroughs on a variety of pattern recognition problems. Banks of finite impulse response filters are learned on a hierarchy of layers, each contributing more abstract information than the previous layer. The simplicity and elegance of the convolutional filtering process makes them perfect for structured problems such as image, video, or voice, where vertices are homogeneous in the sense of number, location, and strength of neighbors. The vast majority of classification problems, for example in the pharmaceutical, homeland security, and financial domains are unstructured. As these problems are formulated into unstructured graphs, the heterogeneity of these problems, such as number of vertices, number of connections per vertex, and edge strength, cannot be tackled with standard convolutional techniques. We propose a novel neural learning framework that is capable of handling both homogeneous and heterogeneous data, while retaining the benefits of traditional CNN successes.Recently, researchers have proposed variations of CNNs that can handle graph data. In an effort to create learnable filter banks of graphs, these methods either induce constraints on the data or require preprocessing. As opposed to spectral methods, our framework, which we term Graph-CNNs, defines filters as polynomials of functions of the graph adjacency matrix. Graph-CNNs can handle both heterogeneous and homogeneous graph data, including graphs having entirely different vertex or edge sets. We perform experiments to validate the applicability of Graph-CNNs to a variety of structured and unstructured classification problems and demonstrate state-of-the-art results on document and molecule classification problems.

show abstract

Intelligent character recognition using fully convolutional neural networks

Ptucha

Such

Pillai

et al. 2019

Pattern Recognition

123

View full text Add to dashboard Cite

Recurrent Convolutional Structures for Audio Spoof and Video Deepfake Detection

Chintha

Thai

Sohrawardi

et al. 2020

IEEE J. Sel. Top. Signal Process.

108

View full text Add to dashboard Cite

Distracted Driver Detection: Deep Learning vs Handcrafted Features

Hssayeni¹,

Saxena²,

Ptucha³

et al. 2017

View full text Add to dashboard Cite

Manifold based Sparse Representation for robust expression recognition without neutral subtraction

Ptucha

Tsagkatakis

Savakis

2011

View full text Add to dashboard Cite

Semantic Text Summarization of Long Videos

Sah

Kulhare

Gray

et al. 2017

View full text Add to dashboard Cite

YOLOrs: Object Detection in Multimodal Remote Sensing Imagery

Sharma

Dhanaraj

Karnam

et al. 2021

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

Deep-learning object detection methods that are designed for computer vision applications tend to under-perform when applied to remote sensing data. This is because, contrary to computer vision, in remote sensing training data are harder to collect and targets can be very small, occupying only a few pixels in the entire image, and exhibit arbitrary perspective transformations. Detection performance can improve by fusing data from multiple remote sensing modalities, including RGB, IR, hyper-spectral, multi-spectral, synthetic aperture radar, and LiDAR, to name a few. In this work, we propose YOLOrs: a new convolutional neural network, specifically designed for realtime object detection in multimodal remote sensing imagery. YOLOrs can detect objects at multiple scales, with smaller receptive fields to account for small targets, as well as predict target orientations. In addition, YOLOrs introduces a novel midlevel fusion architecture that renders it applicable to multimodal aerial imagery. Our experimental studies compare YOLOrs with contemporary alternatives and corroborate its merits.

show abstract

General-Purpose Deep Point Cloud Feature Extractor

Domínguez

Dhamdhere

Petkar

et al. 2018

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Raymond Ptucha

Robust Spatial Filtering With Graph Convolutional Neural Networks

Intelligent character recognition using fully convolutional neural networks

Recurrent Convolutional Structures for Audio Spoof and Video Deepfake Detection

Distracted Driver Detection: Deep Learning vs Handcrafted Features

Manifold based Sparse Representation for robust expression recognition without neutral subtraction

Semantic Text Summarization of Long Videos

YOLOrs: Object Detection in Multimodal Remote Sensing Imagery

General-Purpose Deep Point Cloud Feature Extractor

Contact Info

Product

Resources

About