Learning Activation Functions: A new paradigm for understanding Neural Networks

Goyal, Mohit; Goyal, Rakesh; Lall, Brejesh

doi:10.48550/arxiv.1906.09529

Cited by 6 publications

(9 citation statements)

References 4 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The Convolutional filters are applied to the input through the convolutional layers of CNN to compute the output of the neurons connected to specific regions in the input. It helps to extract the temporal and spatial features from an image (Goyal et al, 2019).…”

Section: Convolutional Neural Networkmentioning

confidence: 99%

See 1 more Smart Citation

Ensemble deep learning for brain tumor detection

Alsubai

Khan

Alqahtani

et al. 2022

Front. Comput. Neurosci.

View full text Add to dashboard Cite

With the quick evolution of medical technology, the era of big data in medicine is quickly approaching. The analysis and mining of these data significantly influence the prediction, monitoring, diagnosis, and treatment of tumor disorders. Since it has a wide range of traits, a low survival rate, and an aggressive nature, brain tumor is regarded as the deadliest and most devastating disease. Misdiagnosed brain tumors lead to inadequate medical treatment, reducing the patient's life chances. Brain tumor detection is highly challenging due to the capacity to distinguish between aberrant and normal tissues. Effective therapy and long-term survival are made possible for the patient by a correct diagnosis. Despite extensive research, there are still certain limitations in detecting brain tumors because of the unusual distribution pattern of the lesions. Finding a region with a small number of lesions can be difficult because small areas tend to look healthy. It directly reduces the classification accuracy, and extracting and choosing informative features is challenging. A significant role is played by automatically classifying early-stage brain tumors utilizing deep and machine learning approaches. This paper proposes a hybrid deep learning model Convolutional Neural Network-Long Short Term Memory (CNN-LSTM) for classifying and predicting brain tumors through Magnetic Resonance Images (MRI). We experiment on an MRI brain image dataset. First, the data is preprocessed efficiently, and then, the Convolutional Neural Network (CNN) is applied to extract the significant features from images. The proposed model predicts the brain tumor with a significant classification accuracy of 99.1%, a precision of 98.8%, recall of 98.9%, and F1-measure of 99.0%.

show abstract

Section: Convolutional Neural Networkmentioning

confidence: 99%

“…The max and average are the two popular methods of the max-pooling layer. The fully connected layer with the 512 unit is used to classify the image into different classes (Goyal et al, 2019;Kang et al, 2021). For feature map normalization, the batch normalization layer is used.…”

Section: Convolutional Neural Networkmentioning

confidence: 99%

Ensemble deep learning for brain tumor detection

Alsubai

Khan

Alqahtani

et al. 2022

Front. Comput. Neurosci.

View full text Add to dashboard Cite

show abstract

“…ω for all l, φ 1 (x) = max{x, 0}, and φ 2 (x) = (e x − 1) • I x≤0 (x), the Kronecker network becomes a FF network with Exponential Linear Unit (ELU) activation [7] if ω l 1 = 1 for all l, and becomes a FF network with Scaled Exponential Linear Unit (SELU) activation [24] if ω l 1 = ω for all l. • If K = 1, the Kronecker network becomes a feed-forward neural network with layerwise locally adaptive activation functions [20,21]. • If ω l = 1 for all l and φ k (x) = x k−1 for all k, the Kronecker network becomes a feedforward neural network with self-learnable activation functions (SLAF) [11]. Similarly, a FFN with smooth adaptive activation function [17] can be represented by a Kronecker network.…”

Section: Mathematical Setup and Kronecker Neural Networkmentioning

confidence: 99%

“…However, there is no rule of thumb of choosing an optimal activation function. This has motivated the use of adaptive activation functions by our group and others, see [1,17,20,20,21,53,11,45], with varying results demonstrating superior performance over non-adaptive fixed activation functions in various learning tasks.…”

mentioning

confidence: 99%

Deep Kronecker neural networks: A general framework for neural networks with adaptive activation functions

Jagtap¹,

Shin²,

Kawaguchi³

et al. 2021

Preprint

View full text Add to dashboard Cite

We propose a new type of neural networks, Kronecker neural networks (KNNs), that form a general framework for neural networks with adaptive activation functions. KNNs employ the Kronecker product, which provides an efficient way of constructing a very wide network while keeping the number of parameters low. Our theoretical analysis reveals that under suitable conditions, KNNs induce a faster decay of the loss than that by the feed-forward networks. This is also empirically verified through a set of computational examples. Furthermore, under certain technical assumptions, we establish global convergence of gradient descent for KNNs. As a specific case, we propose the Rowdy activation function that is designed to get rid of any saturation region by injecting sinusoidal fluctuations, which include trainable parameters. The proposed Rowdy activation function can be employed in any neural network architecture like feed-forward neural networks, Recurrent neural networks, Convolutional neural networks etc. The effectiveness of KNNs with Rowdy activation is demonstrated through various computational experiments including function approximation using feed-forward neural networks, solution inference of partial differential equations using the physicsinformed neural networks, and standard deep learning benchmark problems using convolutional and fully-connected neural networks.

show abstract

“…While smooth activation functions such as sigmoid, logistic, or hyperbolic tangent are widely used in machine learning, they suffer from the "vanishing gradient problem" [6] because their derivatives are zero for large inputs. Neural networks based on polynomial activation functions are an alternative [10,12,20,21,37,57], but can be numerically unstable due to large gradients for large inputs [6]. Moreover, polynomials do not approximate non-smooth functions efficiently [56], which can yield optimization issues in classification problems.…”

Section: Introductionmentioning

confidence: 99%

Rational neural networks

Boullé¹,

Nakatsukasa²,

Townsend³

2020

Preprint

View full text Add to dashboard Cite

We consider neural networks with rational activation functions. The choice of the nonlinear activation function in deep learning architectures is crucial and heavily impacts the performance of a neural network. We establish optimal bounds in terms of network complexity and prove that rational neural networks approximate smooth functions more efficiently than ReLU networks. The flexibility and smoothness of rational activation functions make them an attractive alternative to ReLU, as we demonstrate with numerical experiments.Preprint. Under review.

show abstract

Learning Activation Functions: A new paradigm for understanding Neural Networks

Cited by 6 publications

References 4 publications

Ensemble deep learning for brain tumor detection

Ensemble deep learning for brain tumor detection

Deep Kronecker neural networks: A general framework for neural networks with adaptive activation functions

Rational neural networks

Contact Info

Product

Resources

About