Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-1798
|View full text |Cite
|
Sign up to set email alerts
|

Transfer Learning and Data Augmentation Techniques to the COVID-19 Identification Tasks in ComParE 2021

Abstract: In this work, we propose several techniques to address data scarceness in ComParE 2021 COVID-19 identification tasks for the application of deep models such as Convolutional Neural Networks. Data is initially preprocessed into spectrogram or MFCC-gram formats. After preprocessing, we combine three different data augmentation techniques to be applied in model training. Then we employ transfer learning techniques from pretrained audio neural networks. Those techniques are applied to several distinct neural archi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
16
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(18 citation statements)
references
References 0 publications
0
16
0
Order By: Relevance
“…Coppock et al (2022) presented a summary of the INTERSPEECH 2021 ComParE. A cough and speech UAR of 75.9% (Casanova et al, 2021) and 72.1% (Schuller et al, 2021) was achieved, respectively. Although we reached a slightly lower UAR for cough (UAR = 75.54%), it is worth noting that we did not use any data augmentation and deep learning methods.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Coppock et al (2022) presented a summary of the INTERSPEECH 2021 ComParE. A cough and speech UAR of 75.9% (Casanova et al, 2021) and 72.1% (Schuller et al, 2021) was achieved, respectively. Although we reached a slightly lower UAR for cough (UAR = 75.54%), it is worth noting that we did not use any data augmentation and deep learning methods.…”
Section: Discussionmentioning
confidence: 99%
“…Unlike for cough, we did not achieve a good performance for speech tasks compared to the baseline shown by Schuller et al (2021) (UAR = 72.1%). Casanova et al (2021), when exploring the same approach utilized for cough, had achieved a UAR of 70.3%. Klumpp et al (2021) explored Mel spectrograms and various classifiers, such as LSTM, CNN, SVM, and LR, with data augmentation, and a UAR of 64.2% was reached.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…in [13,14,15]. In [13], the authors proposed an ensemble of CNN classifiers from different acoustic features.…”
Section: Related Workmentioning
confidence: 99%
“…Ko et al [4] evaluated a low-cost data augmentation technique by speed perturbation and found it improved the word error rate (WER) over other methods. Casanova et al [5] employed both transfer learning and data augmentation for improving COVID-19 detection from cough sounds. Similarly, data augmentation could improve SER performances in specific ways.…”
Section: Introductionmentioning
confidence: 99%