The Curious Case of Convex Neural Networks

Sivaprasad, Sarath; Singh, Ankur; Manwani, Naresh; Gandhi, Vineet

doi:10.48550/arxiv.2006.05103

Cited by 2 publications

(3 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Proposed recently in [62], they have known a growing interest thanks to their original properties. They have found applications for classification problems [63], for new control strategies [64] and to approximate the convex functions space [65]. For a given input x ∈ R d , the output z L of the network with L layers of D hidden neurons is defined recursively by the following expression for i ∈ [0, 1, .., L] :…”

Section: Minimax Formulation Of the Dual Monge-kantorovich Problemmentioning

confidence: 99%

Cross-structures Deep Transfer Learning through Kantorovich potentials for Lamb Waves based Structural Health Monitoring.

Postorino

Monteiro²,

Rébillat³

et al. 2023

Journal of Structural Dynamics

View full text Add to dashboard Cite

In Lamb Waves based Structural Health Monitoring (LWSHM) of composite aeronautic structures, Deep Learning (DL) methods have proven to be promising to monitor damage using the signals collected by piezoelectric sensors (PZTs). However, those data driven algorithms are strongly problem dependent: any structural change dramatically impacts the accuracy of the predictions and the generalization of the learnt algorithms to other structures within the fleet is impossible. Transfer Learning (TL) promises to face that issue by capitalizing on the knowledge acquired on a given structure to transfer it on another from the fleet. An original TL approach based on the Optimal Transport (OT) theory is proposed here to handle this issue. OT provides a rigorous mathematical framework for TL that can be practically implemented using Input Convex Neural Networks modelling Kantorovich potentials but that has never been used for LWSHM. Using OT, the knowledge acquired on a rich LW database is transferred to poorer LW databases collected on different structures with rising structural divergences. A Structural Index (SI) is defined and used to compute the gap between those different structures and can be used to estimate a priori the necessity of the use of TL methods. The proposed OT based TL method for LWSHM manages to reduce by almost 50\% the predictions errors between numerical structures with strong differences (bias in mechanical properties and erroneous PZT position) in comparison with standard approaches. That leads to a promising approach to combine rich numerical database with poorer database in order to build robust algorithms for LWSHM of a fleet of aeronautical composite structures.

show abstract

Section: Minimax Formulation Of the Dual Monge-kantorovich Problemmentioning

confidence: 99%

Cross-structures Deep Transfer Learning through Kantorovich potentials for Lamb Waves based Structural Health Monitoring.

Postorino

Monteiro²,

Rébillat³

et al. 2023

Journal of Structural Dynamics

View full text Add to dashboard Cite

show abstract

“…In another work, [16] describe an ICNN architecture for binary classification, where the output is a two dimensional vector. Each element of this vector is a convex function of the inputs.…”

Section: Cdinn-convex Difference Neural Networkmentioning

confidence: 99%

“…In particular, when biases are ignored, even identity mapping cannot be learnt by ICNN without pass-through layers. One approach to address this problem is suggested in [16], where it is proposed to use Leaky ReLU or ELU as activation function. ELU activation function is given by output = ( ⋅( −1) if < 0, and if >= 0) and Leaky ReLU is given by output = ( if < 0, and if >= 0), where is the input and is a hyperparameter.…”

Section: Pass-through Layersmentioning

confidence: 99%

CDiNN -Convex Difference Neural Networks

Parameswaran¹,

Rengaswamy²

2021

Preprint

View full text Add to dashboard Cite

Introduce new Neural network architecture for efficient decision making without significant loss of representational capability • Use of Difference of Convex (DC) programming in decision making involving neural networks • DC optimization produces better result at each iteration and guarantees convergence with the proposed neural network architecture• Optimization problem at each step reduces to Linear Programming problem with the proposed network architecture• Illustration of advantages of this neural network structure through several case studies

show abstract

The Curious Case of Convex Neural Networks

Cited by 2 publications

References 15 publications

Cross-structures Deep Transfer Learning through Kantorovich potentials for Lamb Waves based Structural Health Monitoring.

Cross-structures Deep Transfer Learning through Kantorovich potentials for Lamb Waves based Structural Health Monitoring.

CDiNN -Convex Difference Neural Networks

Contact Info

Product

Resources

About