Jonathan Gerrand scite author profile

Jonathan Gerrand

5Publications

25Citation Statements Received

78Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of the Witwatersrand, Oak Ridge National Laboratory, Council for Scientific and Industrial Research

Publications

Order By: Most citations

Apache Spark Accelerated Deep Learning Inference for Large Scale Satellite Image Analytics

Lunga

Gerrand

Yang

et al. 2020

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

The shear volumes of data generated from earth observation and remote sensing technologies continue to make major impact; leaping key geospatial applications into the dual data and compute intensive era. As a consequence, this rapid advancement poses new computational and data processing challenges. We implement a novel remote sensing data flow (RESFlow) for advanced machine learning and computing with massive amounts of remotely sensed imagery. The core contribution is partitioning massive amount of data based on the spectral and semantic characteristics for distributed imagery analysis. RESFlow takes advantage of both a unified analytics engine for large-scale data processing and the availability of modern computing hardware to harness the acceleration of deep learning inference on expansive remote sensing imagery. The framework incorporates a strategy to optimize resource utilization across multiple executors assigned to a single worker. We showcase its deployment across computationally and data-intensive on pixel-level labeling workloads. The pipeline invokes deep learning inference at three stages; during deep feature extraction, deep metric mapping, and deep semantic segmentation. The tasks impose compute intensive and GPU resource sharing challenges motivating for a parallelized pipeline for all execution steps. By taking advantage of Apache Spark, Nvidia DGX1, and DGX2 computing platforms, we demonstrate unprecedented compute speed-ups for deep learning inference on pixel labeling workloads; processing 21,028 Terrabytes of imagery data and delivering an output maps at area rate of 5.245sq.km/sec, amounting to 453,168 sq.km/day -reducing a 28 day workload to 21 hours.

show abstract

ReSFlow: A Remote Sensing Imagery Data-Flow for Improved Model Generalization

Lunga

Arndt

Gerrand

et al. 2021

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

As satellite imagery collections continue to grow at an astonishing rate, so is the demand for automated and scalable object detection and segmentation. Scaling computational activities demand models that generalize well across various challenges that can hamper progress, including 1) diverse imaging and geographic conditions, 2) sampling bias in training data, 3) manual ground truth collection, 4) tooling for model reuse and accountability assessment, and 5) poor model training strategies. A great deal of progress has been made on these challenges. We contribute to the improvement through further development of ReSFlow, a workflow that breaks the problem of model generalization into a collection of specialized exploitations. ReSFlow partitions imagery collections into homogeneous buckets equipped with exploitation models trained to perform well under each bucket's specific context. Essentially ReSFlow aims for generalization through stratification. Therefore, within a bucket, exploitation is a homogeneous process that mitigates heterogeneity challenges, including the number of training data and data biases that can occur over varied conditions. Furthermore, custom model architectures and rich training strategies effective for within bucket conditions can be developed. Meanwhile, across bucket performance metrics support systematic views of the workflow leading to optimal data labeling allocations and indications that further specialization (and retooling of models) is warranted. Herein, we discuss the formation of models during the framework's "Offline Initialization" stage. Lastly, we exploit the inherent parallelism due to bucketing to introduce model reuse and demonstrate efficacy by reducing an 89-day manual data labeling cost to zero-days in a new area of interest.

show abstract

Paediatric Frontal Chest Radiograph Screening with Fine-Tuned Convolutional Neural Networks

Gerrand

Williams

Lunga

et al. 2017

View full text Add to dashboard Cite

Apache Spark Accelerated Deep Learning Inference for Large Scale Satellite Image Analytics

Lunga¹,

Gerrand²,

Yang³

et al. 2019

Preprint

View full text Add to dashboard Cite

Seeing things clearly: Image understanding through hard-attention and reasoning with structured knowledge

Gerrand

2021

Preprint

View full text Add to dashboard Cite

In this talk, Jonathan aims to frame the current challenges of explainability and understanding in ML-driven approaches to image processing, and their potential solution through explicit inference techniques.Jonathan works as a senior data scientist at EXPLORE-AI and is a member of the RAIL lab at the University of the Witwatersrand, South Africa. His research is focused on capturing higher-order reasoning behind diagnostic outcomes in radiological investigations using agent-based methods; allowing learning models to better understand and articulate knowledge in a clinical setting.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.