Overparameterized neural networks implement associative memory

Radhakrishnan, Adityanarayanan; Belkin, Mikhail; Uhler, Caroline

doi:10.1073/pnas.2005013117

Cited by 34 publications

(42 citation statements)

References 22 publications

(26 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…While autoencoders and other generative models have been used for computing signatures of perturbations also in other works 39,40 , these works have used autoencoders in the standard way to obtain a lower-dimensional embedding of the data. Motivated by our recent work which, quite counter-intuitively, described various benefits of using autoencoders to learn a latent representation of the data that is higher-dimensional than the original space 41 , we found that overparameterized autoencoders not only led to the better Protein coding genes SARS-CoV-2 L1000 Fig. 3 Mining FDA-approved drugs by correlating disease and drug signatures using an overparameterized autoencoder embedding.…”

Section: Resultsmentioning

confidence: 99%

“…First, in order to ensure that the CMap database, which measures expression using 1000 representative genes, would be useful in the context of SARS-CoV-2, we validated that the intersection of these genes with the SARS-CoV-2 differentially expressed genes was significant. Second, to establish drug signatures based on the CMap database, we employed a particular autoencoder framework 41 . Rather unintuitively, we showed that using an overparameterized autoencoder, i.e., by using an autoencoder not to perform dimension reduction as usual but to instead embed the data into a higher-dimensional space, aligned the drug signatures across different cell types.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

Causal network models of SARS-CoV-2 expression and aging to identify candidates for drug repurposing

et al. 2021

Self Cite

View full text Add to dashboard Cite

Given the severity of the SARS-CoV-2 pandemic, a major challenge is to rapidly repurpose existing approved drugs for clinical interventions. While a number of data-driven and experimental approaches have been suggested in the context of drug repurposing, a platform that systematically integrates available transcriptomic, proteomic and structural data is missing. More importantly, given that SARS-CoV-2 pathogenicity is highly age-dependent, it is critical to integrate aging signatures into drug discovery platforms. We here take advantage of large-scale transcriptional drug screens combined with RNA-seq data of the lung epithelium with SARS-CoV-2 infection as well as the aging lung. To identify robust druggable protein targets, we propose a principled causal framework that makes use of multiple data modalities. Our analysis highlights the importance of serine/threonine and tyrosine kinases as potential targets that intersect the SARS-CoV-2 and aging pathways. By integrating transcriptomic, proteomic and structural data that is available for many diseases, our drug discovery platform is broadly applicable. Rigorous in vitro experiments as well as clinical trials are needed to validate the identified candidate drugs.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

Causal network models of SARS-CoV-2 expression and aging to identify candidates for drug repurposing

et al. 2021

Self Cite

View full text Add to dashboard Cite

show abstract

“…For example, we used the same code to analyze the six influenza studies (with rank between 6-9) and the HIV-1 Catnap dataset containing hundreds of studies (with rank 23). Further refinements to our approach could incorporate side information such as virus sequence (Radhakrishnan et al, 2021) or use tensor factorization to decompose higher-order data (Liu & Moitra, 2020).…”

Section: Discussionmentioning

confidence: 99%

Harnessing Matrix Completion to Unify and Extend Viral Serology Studies

Einav

Cleary

2021

Preprint

View full text Add to dashboard Cite

Characterizing the antibody response against large panels of viral variants provides unique insight into key processes that shape viral evolution and host antibody repertoires, and has become critical to the development of new vaccine strategies. Given the enormous diversity of circulating virus strains and antibody responses, exhaustive testing of all antibody-virus interactions is unfeasible. However, prior studies have demonstrated that, despite the complexity of these interactions, their functional phenotypes can be characterized in a vastly simpler and lower-dimensional space, suggesting that matrix completion of relatively few measurements could accurately predict unmeasured antibody-virus interactions. Here, we combine available data from several of the largest-scale studies for both influenza and HIV-1 and demonstrate how matrix completion can substantially expedite experiments. We explore how prediction accuracy evolves as the number of available measurements changes and approximate the number of additional measurements necessary in several highly incomplete datasets (suggesting ~250,000 measurements could be saved). In addition, we show how the method can be used to combine disparate datasets, even when the number of available measurements is below the theoretical limit for successful prediction. Our results suggest new approaches to improve ongoing experimental design, and could be readily generalized to other viruses or more broadly to other low-dimensional biological datasets.

show abstract

“…One line of enquiry is interpreting depth recursively. Indeed, in certain settings increasing the depth manifests similarly to iterating a map given by a shallow network (Radhakrishnan, Belkin and Uhler 2020). Furthermore, fixed points of such iterations have been proposed as an alternative to deep networks, with some success (Bai, Kolter and Koltun 2019).…”

Section: Are Deep Neural Network Kernel Machines?mentioning

confidence: 99%

Fit without fear: remarkable mathematical phenomena of deep learning through the prism of interpolation

Belkin¹

2021

Acta Numerica

Self Cite

View full text Add to dashboard Cite

In the past decade the mathematical theory of machine learning has lagged far behind the triumphs of deep neural networks on practical challenges. However, the gap between theory and practice is gradually starting to close. In this paper I will attempt to assemble some pieces of the remarkable and still incomplete mathematical mosaic emerging from the efforts to understand the foundations of deep learning. The two key themes will be interpolation and its sibling over-parametrization. Interpolation corresponds to fitting data, even noisy data, exactly. Over-parametrization enables interpolation and provides flexibility to select a suitable interpolating model.As we will see, just as a physical prism separates colours mixed within a ray of light, the figurative prism of interpolation helps to disentangle generalization and optimization properties within the complex picture of modern machine learning. This article is written in the belief and hope that clearer understanding of these issues will bring us a step closer towards a general theory of deep learning and machine learning.

show abstract

Overparameterized neural networks implement associative memory

Cited by 34 publications

References 22 publications

Causal network models of SARS-CoV-2 expression and aging to identify candidates for drug repurposing

Causal network models of SARS-CoV-2 expression and aging to identify candidates for drug repurposing

Harnessing Matrix Completion to Unify and Extend Viral Serology Studies

Fit without fear: remarkable mathematical phenomena of deep learning through the prism of interpolation

Contact Info

Product

Resources

About