2022
DOI: 10.3389/fpubh.2022.819865
|View full text |Cite
|
Sign up to set email alerts
|

A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification

Abstract: Understanding the reason for an infant's cry is the most difficult thing for parents. There might be various reasons behind the baby's cry. It may be due to hunger, pain, sleep, or diaper-related problems. The key concept behind identifying the reason behind the infant's cry is mainly based on the varying patterns of the crying audio. The audio file comprises many features, which are highly important in classifying the results. It is important to convert the audio signals into the required spectrograms. In thi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(7 citation statements)
references
References 31 publications
0
7
0
Order By: Relevance
“…Researchers conducted a study on classifying infant crying into four categories: hunger, pain, tiredness, and diaper (Joshi et al, 2022 ). They first preprocessed the signals and converted them into Mel-spectrograms.…”
Section: Related Workmentioning
confidence: 99%
“…Researchers conducted a study on classifying infant crying into four categories: hunger, pain, tiredness, and diaper (Joshi et al, 2022 ). They first preprocessed the signals and converted them into Mel-spectrograms.…”
Section: Related Workmentioning
confidence: 99%
“…Most researchers have adopted the cepstral domain features in the feature extraction from audio signals such as Mel frequency cepstral coefficients (MFCC) [ 33 , 34 , 35 , 36 ], linear frequency cepstral coefficients (LFCC) [ 37 ], short-time cepstral coefficients (STCC) [ 37 ], and Bark frequency cepstral coefficients (BFCC) [ 38 ], combined with both DL and traditional ML models. MFCCs were the most used in identifying infant pathologies.…”
Section: Literature Reviewmentioning
confidence: 99%
“…A similar feature extraction was also used along with KNN in [ 35 ] and achieved an accuracy of 71.42% in determining the reason for crying, including hunger, belly pain, need for burping, discomfort, and tiredness. In [ 36 ], MFCC was used with the CNN model with multiple variants to test and multistage a heterogeneous stacking ensemble model, which consists of four levels of algorithms, Nu-support vector classification, random forest (RF), XGBoost, and AdaBoost. The classification results of the CNN model outperformed the other ML algorithms, reaching an accuracy of 93.7%.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Ting et al 21 classified asphyxia infant cry using hybrid speech features and CNN. Joshi et al 22 proposed a multistage heterogeneous ensemble model for augmented infant cry classification. Initially, the mel‐frequency cepstral coefficients algorithm was used to generate the spectrograms and to analyze the varying feature vectors.…”
Section: Literature Reviewmentioning
confidence: 99%