Max Ehrlich scite author profile

We introduce a general method of performing Residual Network inference and learning in the JPEG transform domain that allows the network to consume compressed images as input. Our formulation leverages the linearity of the JPEG transform to redefine convolution and batch normalization with a tune-able numerical approximation for ReLu. The result is mathematically equivalent to the spatial domain network up to the ReLu approximation accuracy. A formulation for image classification and a model conversion algorithm for spatial domain networks are given as examples of the method. We show skipping the costly decompression step allows for faster processing of images with little to no penalty in the network accuracy.1. The general method for expressing convolutional networks in the JPEG domain 2. Concrete formulation for residual blocks to perform classification 3. A model conversion algorithm to apply pretrained spatial domain networks to JPEG images 4. Approximated Spatial Masking: the first general technique for application of piecewise linear functions in the transform domain By skipping the decompression step and by operating on the compressed format, we show a notable increase in speed for testing and a marginal speed for training. Prior WorkWe review prior work separated into three categories: compressed domain operations, machine learning in the compressed domain, and deep learning in the compressed domain. Compressed Domain OperationsThe expression of common operations in the compressed domain was an extremely active area of study in the late 80s and early 90s, motivated by the lack of computing power to 1

show abstract

Stacked U-Nets for Ground Material Segmentation in Remote Sensing Imagery

Ghosh

Ehrlich

Shah

et al. 2018

View full text Add to dashboard Cite

Analyzing and Mitigating JPEG Compression Defects in Deep Learning

Ehrlich

Davis

Lim

et al. 2021

View full text Add to dashboard Cite

Deep Residual Learning in the JPEG Transform Domain

Ehrlich¹,

Davis²

2018

Preprint

View full text Add to dashboard Cite

A Frequency Perspective of Adversarial Robustness

Maiya¹,

Ehrlich²,

Agarwal³

et al. 2021

Preprint

View full text Add to dashboard Cite

Adversarial examples pose a unique challenge for deep learning systems. Despite recent advances in both attacks and defenses, there is still a lack of clarity and consensus in the community about the true nature and underlying properties of adversarial examples. A deep understanding of these examples can provide new insights towards the development of more effective attacks and defenses. Driven by the common misconception that adversarial examples are high-frequency noise, we present a frequency-based understanding of adversarial examples, supported by theoretical and empirical findings. Our analysis shows that adversarial examples are neither in high-frequency nor in low-frequency components, but are simply dataset dependent. Particularly, we highlight the glaring disparities between models trained on CIFAR-10 and ImageNet-derived datasets. Utilizing this framework, we analyze many intriguing properties of training robust models with frequency constraints, and propose a frequency-based explanation for the commonly observed accuracy vs. robustness trade-off.

show abstract

Unsupervised Super-Resolution of Satellite Imagery for High Fidelity Material Label Transfer

Ghosh

Ehrlich

Davis

et al. 2019

View full text Add to dashboard Cite

Urban material recognition in remote sensing imagery is a highly relevant, yet extremely challenging problem due to the difficulty of obtaining human annotations, especially on low resolution satellite images. To this end, we propose an unsupervised domain adaptation based approach using adversarial learning. We aim to harvest information from smaller quantities of high resolution data (source domain) and utilize the same to super-resolve low resolution imagery (target domain). This can potentially aid in semantic as well as material label transfer from a richly annotated source to a target domain.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Max Ehrlich

Facial Attributes Classification Using Multi-task Representation Learning

Quantization Guided JPEG Artifact Correction

Deep Residual Learning in the JPEG Transform Domain

Stacked U-Nets for Ground Material Segmentation in Remote Sensing Imagery

Analyzing and Mitigating JPEG Compression Defects in Deep Learning

Deep Residual Learning in the JPEG Transform Domain

A Frequency Perspective of Adversarial Robustness

Unsupervised Super-Resolution of Satellite Imagery for High Fidelity Material Label Transfer

Contact Info

Product

Resources

About