Stable architectures for deep neural networks

Haber, Eldad; Ruthotto, Lars

doi:10.1088/1361-6420/aa9a90

Cited by 494 publications

(597 citation statements)

References 47 publications

Supporting

Mentioning

575

Contrasting

Unclassified

Order By: Relevance

“…For image deblurring, images computed with too small values of T remain blurry, while for T > T ringing artifacts are generated and their intensity increase with larger T . For a corrupted image, the associated adjoint state requires the knowledge of the ground truth for the terminal condition (14), which is in general not available. However, Figure 7 shows that the learned average optimal stopping time T yields the smallest expected error.…”

Section: Resultsmentioning

confidence: 99%

Variational Networks: An Optimal Control Approach to Early Stopping Variational Methods for Image Restoration

et al. 2020

View full text Add to dashboard Cite

We investigate a well-known phenomenon of variational approaches in image processing, where typically the best image quality is achieved when the gradient flow process is stopped before converging to a stationary point. This paradox originates from a tradeoff between optimization and modelling errors of the underlying variational model and holds true even if deep learning methods are used to learn highly expressive regularizers from data. In this paper, we take advantage of this paradox and introduce an optimal stopping time into the gradient flow process, which in turn is learned from data by means of an optimal control approach. As a result, we obtain highly efficient numerical schemes that achieve competitive results for image denoising and image deblurring. A nonlinear spectral analysis of the gradient of the learned regularizer gives enlightening insights about the different regularization properties.

show abstract

Section: Resultsmentioning

confidence: 99%

Variational Networks: An Optimal Control Approach to Early Stopping Variational Methods for Image Restoration

et al. 2020

View full text Add to dashboard Cite

show abstract

“…Deep neural networks are capable of approximating nonlinear dynamical systems as shown in many studies 102,103,106,123 . The general nonlinear dynamical system can be presented by an equation of the form…”

Section: Learning Frameworkmentioning

confidence: 99%

A deep learning enabler for nonintrusive reduced order modeling of fluid flows

et al. 2019

View full text Add to dashboard Cite

In this paper, we introduce a modular deep neural network (DNN) framework for data-driven reduced order modeling of dynamical systems relevant to fluid flows. We propose various deep neural network architectures which numerically predict evolution of dynamical systems by learning from either using discrete state or slope information of the system. Our approach has been demonstrated using both residual formula and backward difference scheme formulas. However, it can be easily generalized into many different numerical schemes as well. We give a demonstration of our framework for three examples: (i) Kraichnan-Orszag system, an illustrative coupled nonlinear ordinary differential equations, (ii) Lorenz system exhibiting chaotic behavior, and (iii) a non-intrusive model order reduction framework for the two-dimensional Boussinesq equations with a differentially heated cavity flow setup at various Rayleigh numbers. Using only snapshots of state variables at discrete time instances, our data-driven approach can be considered truly non-intrusive, since any prior information about the underlying governing equations is not required for generating the reduced order model. Our a posteriori analysis shows that the proposed data-driven approach is remarkably accurate, and can be used as a robust predictive tool for non-intrusive model order reduction of complex fluid flows.

show abstract

“…The authors adopted the discretize-then-differentiate viewpoint on the parameter estimation problem and suggested symplectic numerical integration in order to achieve better stability. As mentioned above, our work contrasts in that inference is always exact during learning, unlike the more involved architecture of [HR17] where learning is based on approximate inference. Furthermore, in our case, symplectic numerical integration is a consequence of making the diagram of Figure 2.2 (page 8) commute.…”

mentioning

confidence: 99%

Learning Adaptive Regularization for Image Labeling Using Geometric Assignment

Hühnerbein

Savarino

Petra

et al. 2019

Lecture Notes in Computer Science

View full text Add to dashboard Cite

We study the inverse problem of model parameter learning for pixelwise image labeling, using the linear assignment flow and training data with ground truth. This is accomplished by a Riemannian gradient flow on the manifold of parameters that determine the regularization properties of the assignment flow. Using the symplectic partitioned Runge-Kutta method for numerical integration, it is shown that deriving the sensitivity conditions of the parameter learning problem and its discretization commute. A convenient property of our approach is that learning is based on exact inference. Carefully designed experiments demonstrate the performance of our approach, the expressiveness of the mathematical model as well as its limitations, from the viewpoint of statistical learning and optimal control.

show abstract

Stable architectures for deep neural networks

Cited by 494 publications

References 47 publications

Variational Networks: An Optimal Control Approach to Early Stopping Variational Methods for Image Restoration

Variational Networks: An Optimal Control Approach to Early Stopping Variational Methods for Image Restoration

A deep learning enabler for nonintrusive reduced order modeling of fluid flows

Learning Adaptive Regularization for Image Labeling Using Geometric Assignment

Contact Info

Product

Resources

About