Deep learning: a statistical viewpoint

Bartlett, Peter L.; Montanari, Andrea; Rakhlin, Alexander

doi:10.1017/s0962492921000027

Cited by 97 publications

(62 citation statements)

References 85 publications

(78 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Despite the tile-level false positives, the quantitative measures have shown excellent predictive accuracy, robust performance and generalization on both cohorts. This may be associated with interpolation and generalization capabilities of overparameterized deep learning systems leading to benign overfitting, as demonstrated by Bartlett et al in their latest findings (23).…”

Section: Discussionmentioning

confidence: 91%

See 1 more Smart Citation

AI based pre-screening of large bowel cancer via weakly supervised learning of colorectal biopsy histology images

Bilal

Tsang

Ali

et al. 2022

Preprint

View full text Add to dashboard Cite

Histopathological examination is a pivotal step in the diagnosis and treatment planning of many major diseases. To facilitate the diagnostic decision-making and reduce the workload of pathologists, we present an AI-based pre-screening tool capable of identifying normal and neoplastic colon biopsies. To learn the differential histological patterns from whole slides images (WSIs) stained with hematoxylin and eosin (H&E), our proposed weakly supervised deep learning method requires only slide-level labels and no detailed cell or region-level annotations. The proposed method was developed and validated on an internal cohort of biopsy slides (n=4 292) from two hospitals labeled with corresponding diagnostic categories assigned by pathologists after reviewing case reports. Performance of the proposed colon cancer pre-screening tool was evaluated in a cross-validation setting using the internal cohort (n=4 292) and also by an external validation on The Cancer Genome Atlas (TCGA) cohort (n=731). With overall cross-validated classification accuracy (AUROC = 0.9895) and external validation accuracy (AUROC = 0.9746), the proposed tool promises high accuracy to assist with the pre-screening of colorectal biopsies in clinical practice. Analysis of saliency maps confirms the representation of disease heterogeneity in model predictions and their association with relevant pathological features. The proposed AI tool correctly reported some slides as neoplastic while clinical reports suggested they were normal. Additionally, we analyzed genetic mutations and gene enrichment analysis of AI-generated neoplastic scores to gain further insight into the model predictions and explore the association between neoplastic histology and genetic heterogeneity through representative genes and signaling pathways.

show abstract

Section: Discussionmentioning

confidence: 91%

“…This may be associated with interpolation and generalization capabilities of overparameterized deep learning systems leading to benign overfitting , as demonstrated by Bartlett et al . in their latest findings (23).…”

Section: Discussionmentioning

confidence: 94%

AI based pre-screening of large bowel cancer via weakly supervised learning of colorectal biopsy histology images

Bilal

Tsang

Ali

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…The activation of one neuron can generate a data analysis result, and many neurons are connected to form a complete NN model and output the analysis result of the complete data. The NN technology was first produced in the 1950s and 1960s, and the simplest NN technology is the perceptron, which is essentially a feedforward NN structure, and this NN structure is also relatively common [ 9 ]. The perceptron consists of an input layer, an output layer, and a hidden layer.…”

Section: DL and Security Evaluation Of Enterprisesmentioning

confidence: 99%

Security Evaluation of Financial and Insurance and Ruin Probability Analysis Integrating Deep Learning Models

Yang

2022

Computational Intelligence and Neuroscience

View full text Add to dashboard Cite

To ensure safe development of the financial and insurance industry and promote the continuous growth of the social economy, the theory and its role of deep learning are firstly analyzed. Secondly, the security of financial and insurance and bankruptcy probability are discussed. Finally, an analytical model of the security bankruptcy probability of financial and insurance is designed through a deep learning model, and the model is evaluated comprehensively. The research results manifest that first, the designed security evaluation of the financial and insurance industry based on the deep learning and bankruptcy probability analysis model not only has strong learning ability but also can effectively reduce its own calculation error through short-time learning. Then, by comparing with other models, it is found that the designed model has a stronger ability to control various errors than other models, and the overall error rate of the model can be reduced to about 20%. At last, the data training indicates that the model designed by the deep learning method can accurately and effectively predict the basic situation of the financial and insurance industry, the minimum error can reach 0, and the highest is only about 3. The research provides a technical reference for the development of the financial and insurance industry and contributes to the prosperity of the social economy.

show abstract

“…Several recent works have investigated the nature of modern Deep Neural Networks (DNNs) past the point of zero training error (Belkin, 2021;Nakkiran et al, 2020;Bartlett et al, 2021;Power et al, 2022). The stage at which the training error reaches zero is called the Interpolation Threshold (IT), since at this point, the learned network function interpolates between training samples.…”

Section: Introductionmentioning

confidence: 99%

Nearest Class-Center Simplification through Intermediate Layers

Ido¹,

Dekel²

2022

Preprint

View full text Add to dashboard Cite

Recent advances in theoretical Deep Learning have introduced geometric properties that occur during training, past the Interpolation Thresholdwhere the training error reaches zero. We inquire into the phenomena coined Neural Collapse in the intermediate layers of the networks, and emphasize the innerworkings of Nearest Class-Center Mismatch inside the deepnet. We further show that these processes occur both in vision and language model architectures. Lastly, we propose a Stochastic Variability-Simplification Loss (SVSL) that encourages better geometrical features in intermediate layers, and improves both train metrics and generalization.

show abstract

Deep learning: a statistical viewpoint

Cited by 97 publications

References 85 publications

AI based pre-screening of large bowel cancer via weakly supervised learning of colorectal biopsy histology images

AI based pre-screening of large bowel cancer via weakly supervised learning of colorectal biopsy histology images

Security Evaluation of Financial and Insurance and Ruin Probability Analysis Integrating Deep Learning Models

Nearest Class-Center Simplification through Intermediate Layers

Contact Info

Product

Resources

About