2019 International Conference on Systems, Signals and Image Processing (IWSSIP) 2019
DOI: 10.1109/iwssip.2019.8787318
|View full text |Cite
|
Sign up to set email alerts
|

Identification of Infants’ Cry Motivation Using Spectrograms

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 28 publications
(14 citation statements)
references
References 17 publications
0
14
0
Order By: Relevance
“…We found that several studies that explored the use of minimum, maximum, mean, standard deviation and the variance of MFCCs and other audio features to differentiate normal, hypo-acoustic and asphyxia types using the Chillanto database (6). Support Vector Machines (SVM) are among the most popular infant classification algorithms and routinely outperform neural network classifiers (22,23). Furthermore, Osmani et al have illustrated that boosted and bagging trees outperform SVM cry classification (24).…”
Section: Discussionmentioning
confidence: 99%
“…We found that several studies that explored the use of minimum, maximum, mean, standard deviation and the variance of MFCCs and other audio features to differentiate normal, hypo-acoustic and asphyxia types using the Chillanto database (6). Support Vector Machines (SVM) are among the most popular infant classification algorithms and routinely outperform neural network classifiers (22,23). Furthermore, Osmani et al have illustrated that boosted and bagging trees outperform SVM cry classification (24).…”
Section: Discussionmentioning
confidence: 99%
“…Zhang et al created new waveform images from training datasets by transforming these waveform images into slightly faster or slightly slower waveforms for the purpose of increasing training datasets to overcome overfitting problem [12]. In [43], several data augmentation techniques, such as noise variation, signal intensity variation, tonality variation, and spectrogram's size alteration, were used to artificially increase either the number of audio signals or the number of spectrograms. The experimental results showed that these data augmentation methods cannot lead to accuracy improvement.…”
Section: Data Acquisitionmentioning
confidence: 99%
“…In [24], Singh et al explored the residual MFCC and implicit LP residual features that represent excitation source information. Researchers have also tried other cepstral features such as Fast Fourier Transform (FFT) [23,66], Log-Mel feature [11,18], Mel Scale [43], Constant-Q Chromagram [43], Log-mel spectrum [12], and delta spectrum [12]. According to auditory perception models, MFCC coefficients are more robust than other coefficients such as LPC coefficients.…”
Section: Cepstral Domain Featuresmentioning
confidence: 99%
See 2 more Smart Citations