Grady II, Thomas J. scite author profile

Grady II, Thomas J.

4Publications

5Citation Statements Received

68Citation Statements Given

How they've been cited

How they cite others

104

Affiliations

Georgia Institute of Technology

Publications

Order By: Most citations

A Linear Algebraic Approach to Model Parallelism in Deep Learning

Hewett¹,

J.²

2020

Preprint

View full text Add to dashboard Cite

Training deep neural networks (DNNs) in large-cluster computing environments is increasingly necessary, as networks grow in size and complexity. Local memory and processing limitations require robust data and model parallelism for crossing compute node boundaries. We propose a linear-algebraic approach to model parallelism in deep learning, which allows parallel distribution of any tensor in the DNN. Rather than rely on automatic differentiation tools, which do not universally support distributed memory parallelism models, we show that parallel data movement operations, e.g., broadcast, sum-reduce, and halo exchange, are linear operators, and by defining the relevant spaces and inner products, we manually develop the adjoint, or backward, operators required for gradient-based training of DNNs. We build distributed DNN layers using these parallel primitives, composed with sequential layer implementations, and demonstrate their application by building and training a distributed DNN using DistDL, a PyTorch and MPI-based distributed deep learning toolkit. * https://rjh.io Preprint. Under review.

show abstract

Learned multiphysics inversion with differentiable programming and machine learning

et al. 2023

View full text Add to dashboard Cite

We present the Seismic Laboratory for Imaging and Modeling/Monitoring open-source software framework for computational geophysics and, more generally, inverse problems involving the wave equation (e.g., seismic and medical ultrasound), regularization with learned priors, and learned neural surrogates for multiphase flow simulations. By integrating multiple layers of abstraction, the software is designed to be both readable and scalable, allowing researchers to easily formulate problems in an abstract fashion while exploiting the latest developments in high-performance computing. The design principles and their benefits are illustrated and demonstrated by means of building a scalable prototype for permeability inversion from time-lapse crosswell seismic data, which, aside from coupling of wave physics and multiphase flow, involves machine learning.

show abstract

Model-parallel Fourier neural operators as learned surrogates for large-scale parametric PDEs

Khan²,

Louboutin

et al. 2023

Computers & Geosciences

View full text Add to dashboard Cite

Model-Parallel Fourier Neural Operators as Learned Surrogates for Large-Scale Parametric PDEs

J.¹,

Khan²,

Louboutin³

et al. 2022

Preprint

View full text Add to dashboard Cite

Fourier neural operators (FNOs) are a recently introduced neural network architecture for learning solution operators of partial differential equations (PDEs), which have been shown to perform significantly better than comparable approaches based on convolutional networks. Once trained, FNOs can achieve speed-ups of multiple orders of magnitude over conventional numerical PDE solvers. However, due to the high dimensionality of their input data and network weights, FNOs have so far only been applied to two-dimensional or small three-dimensional problems. To remove this limited problem-size barrier, we propose a model-parallel version of FNOs based on domain-decomposition of both the input data and network weights. We demonstrate that our model-parallel FNO is able to predict time-varying PDE solutions of over 3.2 billion variables on Summit using up to 768 GPUs and show an example of training a distributed FNO on the Azure cloud for simulating multiphase CO2 dynamics in the Earth's subsurface.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Grady II, Thomas J.

A Linear Algebraic Approach to Model Parallelism in Deep Learning

Learned multiphysics inversion with differentiable programming and machine learning

Model-parallel Fourier neural operators as learned surrogates for large-scale parametric PDEs

Model-Parallel Fourier Neural Operators as Learned Surrogates for Large-Scale Parametric PDEs

Contact Info

Product

Resources

About