Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-1679
|View full text |Cite
|
Sign up to set email alerts
|

On-the-Fly Aligned Data Augmentation for Sequence-to-Sequence ASR

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(10 citation statements)
references
References 0 publications
0
9
0
Order By: Relevance
“…Third, AlexNet and VGGNet architecture have some inspirable benefits compared to other CNN architectures such as easy GoogleNet, ResNet, DenseNet, and especially less parameters are required to train the models and result is lightweight of model. Fourth, because of using some diverse mechanisms such as early stopping [ 59 ] that helps to stop over-fitting of the models, weighted random sampler [ 58 ] for reducing the class imbalance problem of the samples, Adam optimizer [ 38 ] for handling the minimum validation loss and quick training, on-the-fly augmentation [ 61 ], stratified evaluation strategy for ensuring the samples from each class to each fold, and reducing the effect of the class imbalance problem [ 62 , 63 ].…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…Third, AlexNet and VGGNet architecture have some inspirable benefits compared to other CNN architectures such as easy GoogleNet, ResNet, DenseNet, and especially less parameters are required to train the models and result is lightweight of model. Fourth, because of using some diverse mechanisms such as early stopping [ 59 ] that helps to stop over-fitting of the models, weighted random sampler [ 58 ] for reducing the class imbalance problem of the samples, Adam optimizer [ 38 ] for handling the minimum validation loss and quick training, on-the-fly augmentation [ 61 ], stratified evaluation strategy for ensuring the samples from each class to each fold, and reducing the effect of the class imbalance problem [ 62 , 63 ].…”
Section: Resultsmentioning
confidence: 99%
“…Most data in medicals and clinics are normal and only a few numbers of data are abnormal. Some anterior arrhythmia works performed augmentation manually but herein we have performed online augmentation on-the-fly [ 61 ] of images. The major benefits of this concept are hassle-free and time-saving unlike manual augmentation.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“… The proposed model expresses model generalization because it was tested on four datasets without changing any hyperparameters, and the model architecture and results are consistent. This achievement is due to the use of some diverse regularization strategies: batch normalization (BN) [ 45 ], call-back features [ 46 ], weighted random sampler [ 47 ], Adam optimizer [ 48 ], on-the-fly augmentation [ 49 ], and appropriate initialization of layers [ 50 ] of the model in the method. …”
Section: Introductionmentioning
confidence: 99%