Antoine Ledent scite author profile

Antoine Ledent

5Publications

15Citation Statements Received

47Citation Statements Given

How they've been cited

How they cite others

Affiliations

Singapore Management University, University of Kaiserslautern

Publications

Order By: Most citations

Norm-based generalisation bounds for multi-class convolutional neural networks

Ledent¹,

Mustafa²,

Lei³

et al. 2019

Preprint

View full text Add to dashboard Cite

We show generalisation error bounds for deep learning with two main improvements over the state of the art. (1) Our bounds have no explicit dependence on the number of classes except for logarithmic factors. This holds even when formulating the bounds in terms of the L 2 -norm of the weight matrices, where previous bounds exhibit at least a square-root dependence on the number of classes. (2) We adapt the classic Rademacher analysis of DNNs to incorporate weight sharing-a task of fundamental theoretical importance which was previously attempted only under very restrictive assumptions. In our results, each convolutional filter contributes only once to the bound, regardless of how many times it is applied. Further improvements exploiting pooling and sparse connections are provided. The presented bounds scale as the norms of the parameter matrices, rather than the number of parameters. In particular, contrary to bounds based on parameter counting, they are asymptotically tight (up to log factors) when the weights approach initialisation, making them suitable as a basic ingredient in bounds sensitive to the optimisation procedure. We also show how to adapt the recent technique of loss function augmentation to our situation to replace spectral norms by empirical analogues whilst maintaining the advantages of our approach.

show abstract

Burst-induced Multi-Armed Bandit for Learning Recommendation

Alves

Ledent

Kloft

2021

View full text Add to dashboard Cite

Orthogonal Inductive Matrix Completion

Ledent

Alves

Kloft

2023

IEEE Trans. Neural Netw. Learning Syst.

View full text Add to dashboard Cite

Learning Interpretable Concept Groups in CNNs

Varshneya

Ledent

Vandermeulen

et al. 2021

View full text Add to dashboard Cite

We propose a novel training methodology---Concept Group Learning (CGL)---that encourages training of interpretable CNN filters by partitioning filters in each layer into \emph{concept groups}, each of which is trained to learn a single visual concept. We achieve this through a novel regularization strategy that forces filters in the same group to be active in similar image regions for a given layer. We additionally use a regularizer to encourage a sparse weighting of the concept groups in each layer so that a few concept groups can have greater importance than others. We quantitatively evaluate CGL's model interpretability using standard interpretability evaluation techniques and find that our method increases interpretability scores in most cases. Qualitatively we compare the image regions which are most active under filters learned using CGL versus filters learned without CGL and find that CGL activation regions more strongly concentrate around semantically relevant features.

show abstract

Fine-grained Generalization Analysis of Structured Output Prediction

Mustafa

Lei

Ledent

et al. 2021

View full text Add to dashboard Cite

In machine learning we often encounter structured output prediction problems (SOPPs), i.e. problems where the output space admits a rich internal structure. Application domains where SOPPs naturally occur include natural language processing, speech recognition, and computer vision. Typical SOPPs have an extremely large label set, which grows exponentially as a function of the size of the output. Existing generalization analysis implies generalization bounds with at least a square-root dependency on the cardinality d of the label set, which can be vacuous in practice. In this paper, we significantly improve the state of the art by developing novel high-probability bounds with a logarithmic dependency on d. Furthermore, we leverage the lens of algorithmic stability to develop generalization bounds in expectation without any dependency on d. Our results therefore build a solid theoretical foundation for learning in large-scale SOPPs. Furthermore, we extend our results to learning with weakly dependent data.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Antoine Ledent

Norm-based generalisation bounds for multi-class convolutional neural networks

Burst-induced Multi-Armed Bandit for Learning Recommendation

Orthogonal Inductive Matrix Completion

Learning Interpretable Concept Groups in CNNs

Fine-grained Generalization Analysis of Structured Output Prediction

Contact Info

Product

Resources

About