Translating Numerical Concepts for PDEs into Neural Architectures

Alt, Tobias; Peter, Pascal; Weickert, Joachim; Schrader, Karl

doi:10.48550/arxiv.2103.15419

Cited by 4 publications

(10 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Section: Architectural Adaptationmentioning

confidence: 53%

“…This approximation is plugged into the PDE, after which we invoke Galerkin's method. We multiply the PDE with a test function and reduce the differentiability requirement on 𝑢 ℎ using integration by parts: 1 ∫…”

Section: Fem Lossmentioning

confidence: 99%

“…Due to the fully convolutional nature of the neural network, once 𝐺 𝑛𝑛 is trained at one resolution, the forward pass of the coefficients through the network itself becomes an excellent starting point for performing interpolation (prolongation) and solving the PDE at a higher resolution. 1 For completeness, we assume 𝑢 ℎ 𝜃 ∈ 𝑉 ⊂ 𝐻 1 (𝐷) where 𝐻 1 (𝐷) denotes the Hilbert space of functions on 𝐷 that have square-integrable first derivatives. 2 Interestingly, while writing this paper, we came across work that hypothesized deep mathematical connections between numerical methods and neural nets [1], with a specific call out to a link between multigrid approaches with U-Net architectures.…”

Section: Multigrid Training Of Mgdiffnetmentioning

confidence: 99%

“…1 For completeness, we assume 𝑢 ℎ 𝜃 ∈ 𝑉 ⊂ 𝐻 1 (𝐷) where 𝐻 1 (𝐷) denotes the Hilbert space of functions on 𝐷 that have square-integrable first derivatives. 2 Interestingly, while writing this paper, we came across work that hypothesized deep mathematical connections between numerical methods and neural nets [1], with a specific call out to a link between multigrid approaches with U-Net architectures. Our work anecdotally validates these assertions.…”

Section: Multigrid Training Of Mgdiffnetmentioning

confidence: 99%

See 3 more Smart Citations

Distributed multigrid neural solvers on megavoxel domains

Balu

Botelho²,

Khara

et al. 2021

Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

View full text Add to dashboard Cite

Section: Architectural Adaptationmentioning

confidence: 53%

Section: Fem Lossmentioning

confidence: 99%

Section: Multigrid Training Of Mgdiffnetmentioning

confidence: 99%

Section: Multigrid Training Of Mgdiffnetmentioning

confidence: 99%

See 2 more Smart Citations

Distributed multigrid neural solvers on megavoxel domains

Balu

Botelho²,

Khara

et al. 2021

Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

View full text Add to dashboard Cite

“…We note that there is a marginal improvement in the loss at the same time; we show that there is a 3× improvement in training time for a very deep U-Net architecture. This ties into the theme of correlations between U-Net architecture and multigrid methods mentioned in Alt et al 1 . With both the architectural adaptation and Half-V cycle, in 2D spatial domain with a resolution of 512 × 512, we get a speedup of 3× over the baseline training approach at full resolution.…”

Section: Architectural Adaptationmentioning

confidence: 53%

Distributed Multigrid Neural Solvers on Megavoxel Domains

Balu¹,

Botelho²,

Khara³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

PDE-Based Group Equivariant Convolutional Neural Networks

et al. 2022

View full text Add to dashboard Cite

We present a PDE-based framework that generalizes Group equivariant Convolutional Neural Networks (G-CNNs). In this framework, a network layer is seen as a set of PDE-solvers where geometrically meaningful PDE-coefficients become the layer’s trainable weights. Formulating our PDEs on homogeneous spaces allows these networks to be designed with built-in symmetries such as rotation in addition to the standard translation equivariance of CNNs. Having all the desired symmetries included in the design obviates the need to include them by means of costly techniques such as data augmentation. We will discuss our PDE-based G-CNNs (PDE-G-CNNs) in a general homogeneous space setting while also going into the specifics of our primary case of interest: roto-translation equivariance. We solve the PDE of interest by a combination of linear group convolutions and nonlinear morphological group convolutions with analytic kernel approximations that we underpin with formal theorems. Our kernel approximations allow for fast GPU-implementation of the PDE-solvers; we release our implementation with this article in the form of the LieTorch extension to PyTorch, available at https://gitlab.com/bsmetsjr/lietorch. Just like for linear convolution, a morphological convolution is specified by a kernel that we train in our PDE-G-CNNs. In PDE-G-CNNs, we do not use non-linearities such as max/min-pooling and ReLUs as they are already subsumed by morphological convolutions. We present a set of experiments to demonstrate the strength of the proposed PDE-G-CNNs in increasing the performance of deep learning-based imaging applications with far fewer parameters than traditional CNNs.

show abstract

Translating Numerical Concepts for PDEs into Neural Architectures

Cited by 4 publications

References 23 publications

Distributed multigrid neural solvers on megavoxel domains

Distributed multigrid neural solvers on megavoxel domains

Distributed Multigrid Neural Solvers on Megavoxel Domains

PDE-Based Group Equivariant Convolutional Neural Networks

Contact Info

Product

Resources

About