2020 International Symposium on Electronics and Telecommunications (ISETC) 2020
DOI: 10.1109/isetc50328.2020.9301059
|View full text |Cite
|
Sign up to set email alerts
|

P-Swish: Activation Function with Learnable Parameters Based on Swish Activation Function in Deep Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 27 publications
(10 citation statements)
references
References 2 publications
0
10
0
Order By: Relevance
“…Many studies have tried to improve the performance of CNNs models from network architecture [22]- [25], different variants of optimization [26]- [28], activations [29]- [32], regularization methods [33], [34] and so no. However, little attention has been paid to investigating the padding schemes during the convolution operation.…”
Section: Literature Review and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Many studies have tried to improve the performance of CNNs models from network architecture [22]- [25], different variants of optimization [26]- [28], activations [29]- [32], regularization methods [33], [34] and so no. However, little attention has been paid to investigating the padding schemes during the convolution operation.…”
Section: Literature Review and Related Workmentioning
confidence: 99%
“…The CIFAR-10 dataset includes a training dataset of 50,000 images and a test dataset of 10,000 images. The images are of shape (32,32,3), distributed equally to ten classes of airplane, automobile, bird, cat, deer, dog, frog, horse, ship, and truck. The Padding Module was applied to different networks namely: VGG16 [8] and ResNet50V2 [39]; to make the deeper layers in these networks carry out a valid convolution, the images were resized to (64, 64, 3) and (224, 224, 3) for the VGG16 and the ResNet50V2, respectively.…”
Section: A Experiments Setupmentioning
confidence: 99%
“…to make use of the general observations that it outperforms, or matches, the widely used ReLU activation function as a result of its improved smoothness and differentiability properties [53,54].…”
Section: Training and Adaptive Sampling Of Training Datamentioning
confidence: 99%
“…The parametric swish (p-swish) [429] is another AF proposed by Mercioni and Holban. It is defined as…”
Section: Parametric Swish (P-swish)mentioning
confidence: 99%