Carlos Esteve scite author profile

Carlos Esteve

3Publications

49Citation Statements Received

102Citation Statements Given

How they've been cited

How they cite others

101

Affiliations

University of Deusto, Laboratoire Analyse, Géométrie et Applications, Université Sorbonne Paris Nord

Publications

Order By: Most citations

Large-time asymptotics in deep learning

Esteve¹,

Geshkovski²,

Pighin³

et al. 2020

Preprint

View full text Add to dashboard Cite

It is by now well-known that practical deep supervised learning may roughly be cast as an optimal control problem for a specific discrete-time, nonlinear dynamical system called an artificial neural network. In this work, we consider the continuous-time formulation of the deep supervised learning problem, and study the latter's behavior when the final time horizon increases, a fact that can be interpreted as increasing the number of layers in the neural network setting.When considering the classical regularized empirical risk minimization problem, we show that, in long time, the optimal states converge to zero training error, namely approach the zero training error regime, whilst the optimal control parameters approach, on an appropriate scale, minimal norm parameters with corresponding states precisely in the zero training error regime. This result provides an alternative theoretical underpinning to the notion that neural networks learn best in the overparametrized regime, when seen from the large layer perspective.We also propose a learning problem consisting of minimizing a cost with a state tracking term, and establish the well-known turnpike property, which indicates that the solutions of the learning problem in long time intervals consist of three pieces, the first and the last of which being transient short-time arcs, and the middle piece being a long-time arc staying exponentially close to the optimal solution of an associated static learning problem. This property in fact stipulates a quantitative estimate for the number of layers required to reach the zero training error regime.Both of the aforementioned asymptotic regimes are addressed in the context of continuous-time and continuous space-time neural networks, the latter taking the form of nonlinear, integro-differential equations, hence covering residual neural networks with both fixed and possibly variable depths. Contents 1. Introduction 2 2. A roadmap to continuous-time supervised learning 8 3. Asymptotics without tracking 13 4. Asymptotics with tracking 25 5. The zero training error regime 47 Date: August 7, 2020.

show abstract

The evolution problem associated with eigenvalues of the Hessian

Blanc

Esteve

Rossi

2020

J. London Math. Soc.

View full text Add to dashboard Cite

In this paper, we study the evolution problem ⎧ ⎪ ⎨ ⎪ ⎩ ut(x, t) − λj(D 2 u(x, t)) = 0, in Ω × (0, +∞), u(x, t) = g(x, t), on ∂Ω × (0, +∞), u(x, 0) = u0(x), in Ω, where Ω is a bounded domain in R N (which verifies a suitable geometric condition on its boundary) and λj(D 2 u) stands for the jth eigenvalue of the Hessian matrix D 2 u. We assume that u0 and g are continuous functions with the compatibility condition u0(x) = g(x, 0), x ∈ ∂Ω. We show that the (unique) solution to this problem exists in the viscosity sense and can be approximated by the value function of a two-player zero-sum game as the parameter measuring the size of the step that we move in each round of the game goes to zero. In addition, when the boundary datum is independent of time, g(x, t) = g(x), we show that viscosity solutions to this evolution problem stabilize and converge exponentially fast to the unique stationary solution as t → ∞. For j = 1, the limit profile is just the convex envelope inside Ω of the boundary datum g, while for j = N , it is the concave envelope. We obtain this result with two different techniques: with partial differential equations (PDE) tools and with game-theoretical arguments. Moreover, in some special cases (for affine boundary data), we can show that solutions coincide with the stationary solution in finite time (which depends only on Ω and not on the initial condition u0).

show abstract

The Inverse Problem for Hamilton--Jacobi Equations and Semiconcave Envelopes

Esteve¹,

Zuazua²

2020

SIAM J. Math. Anal.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Carlos Esteve

Large-time asymptotics in deep learning

The evolution problem associated with eigenvalues of the Hessian

The Inverse Problem for Hamilton--Jacobi Equations and Semiconcave Envelopes

Contact Info

Product

Resources

About