Achieving robustness in classification using optimal transport with hinge regularization

Serrurier, Mathieu; Mamalet, Franck; González-Sanz, Alberto; Boissin, Thibaut; Loubes, Jean–Michel; Barrio, Eustasio del

doi:10.48550/arxiv.2006.06520

Cited by 4 publications

(6 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…(Miyato et al 2018) achieved 1-Lipschitz fully connected layers by bounding the spectralnorm of the weight matrices to be 1. Similarly, (Serrurier et al 2021) considered neural networks f in which each component f i is 1-Lipschitz, thus, differently from the 1-Lipschitz networks mentioned before, given a sample x, the lower bound of MAP is deduced by 1 2 (f l (x) − f s (x)). Other authors leveraged orthogonal weight matrices to pursue the same objective.…”

Section: Related Workmentioning

confidence: 99%

“…The Thirty-Seventh AAAI Conference on Artificial Intelligence network output (Tsuzuku, Sato, and Sugiyama 2018). These particular models can be obtained by composing orthogonal layers (Cisse et al 2017;Li et al 2019;Trockman and Kolter 2021;Serrurier et al 2021; and normpreserving activation functions, such as those presented by (Anil, Lucas, and Grosse 2019;Chernodub and Nowicki 2017). However, despite the satisfaction of the Lipschitz inequality, these models do not provide the exact boundary distance but only a lower bound.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Robust-by-Design Classification via Unitary-Gradient Neural Networks

Brau

Rossolini

Biondi

et al. 2023

AAAI

View full text Add to dashboard Cite

The use of neural networks in safety-critical systems requires safe and robust models, due to the existence of adversarial attacks. Knowing the minimal adversarial perturbation of any input x, or, equivalently, knowing the distance of x from the classification boundary, allows evaluating the classification robustness, providing certifiable predictions. Unfortunately, state-of-the-art techniques for computing such a distance are computationally expensive and hence not suited for online applications. This work proposes a novel family of classifiers, namely Signed Distance Classifiers (SDCs), that, from a theoretical perspective, directly output the exact distance of x from the classification boundary, rather than a probability score (e.g., SoftMax). SDCs represent a family of robust-by-design classifiers. To practically address the theoretical requirements of an SDC, a novel network architecture named Unitary-Gradient Neural Network is presented. Experimental results show that the proposed architecture approximates a signed distance classifier, hence allowing an online certifiable classification of x at the cost of a single inference.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Robust-by-Design Classification via Unitary-Gradient Neural Networks

Brau

Rossolini

Biondi

et al. 2023

AAAI

View full text Add to dashboard Cite

show abstract

“…For an introduction to this Theory, we refer to [16]. Most of the applications of OT are related to the very active field of machine learning, notably in the framework of generative networks [17], robustness [18] or fairness [19], among others. With some notable exemptions [5,[20][21][22][23], Wasserstein distance has not been widely used in structural biology.…”

Section: Distances Between Local and Global Structural Descriptorsmentioning

confidence: 99%

WASCO: A Wasserstein-based statistical tool to compare conformational ensembles of intrinsically disordered proteins

González-Delgado

Sagar

Zanon

et al. 2022

Preprint

View full text Add to dashboard Cite

The structural investigation of intrinsically disordered proteins (IDPs) requires ensemble models describing the diversity of the conformational states of the molecule. Due to their probabilistic nature, there is a need for new paradigms that understand and treat IDPs from a purely statistical point of view, considering their conformational ensembles as well-defined probability distributions. In this work, we define a conformational ensemble as an ordered set of probability distributions and provide a suitable metric to detect differences between two given ensembles at the residue level, both locally and globally. The underlying geometry of the conformational space is properly integrated, being one ensemble characterized by a set of probability distributions supported on the three-dimensional Euclidean space (for global-scale comparisons) and on the two-dimensional flat torus (for local-scale comparisons). The inherent uncertainty of the data is also taken into account to provide finer estimations of the differences between ensembles. Additionally, an overall distance between ensembles is defined from the differences at the residue level. We illustrate the interest of the approach with several examples of applications for the comparison of conformational ensembles: (i) produced from molecular dynamics (MD) simulations using different force fields, and (ii) before and after refinement with experimental data. We also show the usefulness of the method to assess the convergence of MD simulations. The numerical tool has been implemented in Python through easy-to-use Jupyter Notebooks available at https://gitlab.laas.fr/moma/WASCO.

show abstract

“…A fine characterization of the convergence conditions of recurrent neural network and of their stability via the estimation of a Lipschitz constant is done in [37,38]. In particular, the Lipschitz constant estimated in [38] is more accurate than in basic approaches which often rely in computing the product of the norms of the linear weight operators of each layer as in [39,40]. Thanks to the aforementioned works, proofs of convergence and stability have been demonstrated on specific neural networks applied to inverse problems as in [24,25,34].…”

Section: Variational Problemmentioning

confidence: 99%

Inversion of Integral Models: a Neural Network Approach

Chouzenoux,

Della Valle,

Pesquet

2021

Preprint

View full text Add to dashboard Cite

We introduce a neural network architecture to solve inverse problems linked to a onedimensional integral operator. This architecture is built by unfolding a forward-backward algorithm derived from the minimization of an objective function which consists of the sum of a data-fidelity function and a Tikhonov-type regularization function. The robustness of this inversion method with respect to a perturbation of the input is theoretically analyzed. Ensuring robustness is consistent with inverse problem theory since it guarantees both the continuity of the inversion method and its insensitivity to small noise. The latter is a critical property as deep neural networks have been shown to be vulnerable to adversarial perturbations. One of the main novelties of our work is to show that the proposed network is also robust to perturbations of its bias. In our architecture, the bias accounts for the observed data in the inverse problem. We apply our method to the inversion of Abel integral operators, which define a fractional integration involved in wide range of physical processes. The neural network is numerically implemented and tested to illustrate the efficiency of the method. Lipschitz constants after training are computed to measure the robustness of the neural networks.

show abstract

Achieving robustness in classification using optimal transport with hinge regularization

Cited by 4 publications

References 14 publications

Robust-by-Design Classification via Unitary-Gradient Neural Networks

Robust-by-Design Classification via Unitary-Gradient Neural Networks

WASCO: A Wasserstein-based statistical tool to compare conformational ensembles of intrinsically disordered proteins

Inversion of Integral Models: a Neural Network Approach

Contact Info

Product

Resources

About