Deep Supervised Learning Using Local Errors

Mostafa, Hesham; Ramesh, Vishwajith; Cauwenberghs, Gert

doi:10.3389/fnins.2018.00608

Cited by 89 publications

(92 citation statements)

References 60 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Backpropagation is biologically unrealistic for several reasons such as the need to interleave forward and backward passes, and the use of symmetric weights in the forward and backward passes. More biologically-plausible models have been proposed to address these issues that use contrastive learning in energy-based models Xie & Seung (2003); Bengio & Fischer (2015); Scellier & Bengio (2017), or that relax the symmetry requirement by using random weights in the backward pass Lillicrap et al (2016); Baldi et al (2016); Nøkland (2016); Mostafa et al (2017). These methods, however, have been applied in supervised learning settings and their performance and applicability to learning in stochastic networks is unclear.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

A Learning Framework for Winner-Take-All Networks with Stochastic Synapses

Mostafa

Cauwenberghs

2018

Neural Computation

Self Cite

View full text Add to dashboard Cite

Many recent generative models make use of neural networks to transform the probability distribution of a simple low-dimensional noise process into the complex distribution of the data. This raises the question of whether biological networks operate along similar principles to implement a probabilistic model of the environment through transformations of intrinsic noise processes. The intrinsic neural and synaptic noise processes in biological networks, however, are quite different from the noise processes used in current abstract generative networks. This, together with the discrete nature of spikes and local circuit interactions among the neurons, raises several difficulties when using recent generative modeling frameworks to train biologically motivated models. In this letter, we show that a biologically motivated model based on multilayer winner-take-all circuits and stochastic synapses admits an approximate analytical description. This allows us to use the proposed networks in a variational learning setting where stochastic backpropagation is used to optimize a lower bound on the data log likelihood, thereby learning a generative model of the data. We illustrate the generality of the proposed networks and learning technique by using them in a structured output prediction task and a semisupervised learning task. Our results extend the domain of application of modern stochastic network architectures to networks where synaptic transmission failure is the principal noise mechanism.

show abstract

Section: Discussionmentioning

confidence: 99%

“…Further work is needed to develop a more biologicallymotivated learning method, in the spirit of the learning method in ref. Mostafa et al (2017), that learns online and changes synaptic weights based only on information in the pre-and post-synaptic neurons.…”

Section: Introductionmentioning

confidence: 99%

A Learning Framework for Winner-Take-All Networks with Stochastic Synapses

Mostafa

Cauwenberghs

2018

Neural Computation

Self Cite

View full text Add to dashboard Cite

show abstract

“…Consistent with the our results, it was repored the high classification performance using GoogLeNet model pre-trained on Image Net as a feature extractor (Zhu et al, 2019). The deep neural networks (DNNs) are trained using the optimized SGD algorithm, which calculates a expected error gradient for the current model state by the training datasets, corrects the weights of a node in the network each time by backpropagation, where the amount of weight updated during the training is a configurable hyperparameter and called the LR (Mostafa et al, 2018;Zhao et al, 2019). The performance of the SGD depended on how LRs, which controls the rate or speed at the end of each batch of trainings are turned over time (Zhao et al, 2019).…”

Section: Contributions Of Parameters For Prediction Performance In Thmentioning

confidence: 94%

DeepSnap-Deep Learning Approach Predicts Progesterone Receptor Antagonist Activity With High Performance

Matsuzaka

Uesawa

2020

Front. Bioeng. Biotechnol.

View full text Add to dashboard Cite

The progesterone receptor (PR) is important therapeutic target for many malignancies and endocrine disorders due to its role in controlling ovulation and pregnancy via the reproductive cycle. Therefore, the modulation of PR activity using its agonists and antagonists is receiving increasing interest as novel treatment strategy. However, clinical trials using the PR modulators have not yet been found conclusive evidences. Recently, increasing evidence from several fields shows that the classification of chemical compounds, including agonists and antagonists, can be done with recent improvements in deep learning (DL) using deep neural network. Therefore, we recently proposed a novel DL-based quantitative structure-activity relationship (QSAR) strategy using transfer learning to build prediction models for agonists and antagonists. By employing this novel approach, referred as DeepSnap-DL method, which uses images captured from 3-dimension (3D) chemical structure with multiple angles as input data into the DL classification, we constructed prediction models of the PR antagonists in this study. Here, the DeepSnap-DL method showed a high performance prediction of the PR antagonists by optimization of some parameters and image adjustment from 3Dstructures. Furthermore, comparison of the prediction models from this approach with conventional machine learnings (MLs) indicated the DeepSnap-DL method outperformed these MLs. Therefore, the models predicted by DeepSnap-DL would be powerful tool for not only QSAR field in predicting physiological and agonist/antagonist activities, toxicity, and molecular bindings; but also for identifying biological or pathological phenomena.

show abstract

“…Gradient BP can solve this, but is not compatible with a physical implementation of the neural network [6]. Several approximations have emerged recently to solve this, such as feedback alignment [7]- [9], and local losses defined for each layer [10]- [12]. For classification, local losses can be local classifiers (using output labels) [10], and supervised clustering, which perform on par and sometimes better than BP in classical ML benchmark tasks [12].…”

Section: B Local Losses and Local Errorsmentioning

confidence: 99%

“…Several approximations have emerged recently to solve this, such as feedback alignment [7]- [9], and local losses defined for each layer [10]- [12]. For classification, local losses can be local classifiers (using output labels) [10], and supervised clustering, which perform on par and sometimes better than BP in classical ML benchmark tasks [12]. For all experiments used in this work, we use a layerwise local classifier using a mean-squared error loss defined as…”

Section: B Local Losses and Local Errorsmentioning

confidence: 99%

Error-triggered Three-Factor Learning Dynamics for Crossbar Arrays

Payvand

Fouda

Kurdahi

et al. 2020

2020 2nd IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS)

View full text Add to dashboard Cite

Recent breakthroughs suggest that local, approximate gradient descent learning is compatible with Spiking Neural Networks (SNNs). Although SNNs can be scalably implemented using neuromorphic VLSI, an architecture that can learn in situ as accurately as conventional processors is still missing. Here, we propose a subthreshold circuit architecture designed through insights obtained from machine learning and computational neuroscience that could achieve such accuracy. Using a surrogate gradient learning framework, we derive local, errortriggered learning dynamics compatible with crossbar arrays and the temporal dynamics of SNNs. The derivation reveals that circuits used for inference and training dynamics can be shared, which simplifies the circuit and suppresses the effects of fabrication mismatch. We present SPICE simulations on XFAB 180nm process, as well as large-scale simulations of the spiking neural networks on event-based benchmarks, including a gesture recognition task. Our results show that the number of updates can be reduced hundred-fold compared to the standard rule while achieving performances that are on par with the state-of-the-art.

show abstract

Deep Supervised Learning Using Local Errors

Cited by 89 publications

References 60 publications

A Learning Framework for Winner-Take-All Networks with Stochastic Synapses

A Learning Framework for Winner-Take-All Networks with Stochastic Synapses

DeepSnap-Deep Learning Approach Predicts Progesterone Receptor Antagonist Activity With High Performance

Error-triggered Three-Factor Learning Dynamics for Crossbar Arrays

Contact Info

Product

Resources

About