6th International Conference on Sustainable Information Engineering and Technology 2021 2021
DOI: 10.1145/3479645.3479675
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Multiscale-based Peak Detection on Short Time Energy and Spectral Centroid Feature Extraction for Conversational Speech Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 11 publications
0
4
0
Order By: Relevance
“…The short-time average zero crossing rate refers to the number of times the signal crosses the zero value in each frame, which can reflect the frequency spectral characteristics to a certain extent, and is a kind of sound signal time-domain feature often used in speech endpoint detection [14]. As the bowel sounds signals vary in strength, it is difficult to see obvious changes in the STE only for the sudden and weaker bowel sounds, while their short-time average crossing zero rate is usually higher, which can be used as one of the features to analyze the bowel sounds.…”
Section: ) Zero Crossing Rate (Zcr)mentioning
confidence: 99%
See 1 more Smart Citation
“…The short-time average zero crossing rate refers to the number of times the signal crosses the zero value in each frame, which can reflect the frequency spectral characteristics to a certain extent, and is a kind of sound signal time-domain feature often used in speech endpoint detection [14]. As the bowel sounds signals vary in strength, it is difficult to see obvious changes in the STE only for the sudden and weaker bowel sounds, while their short-time average crossing zero rate is usually higher, which can be used as one of the features to analyze the bowel sounds.…”
Section: ) Zero Crossing Rate (Zcr)mentioning
confidence: 99%
“…In cognizance of this, our research adopts a comprehensive approach by considering both frequency domain features and time domain features inherent in neonatal bowel sounds. We have strategically extracted MFCC [12], Short Time Energy (STE) [13], and Zero Crossing Rate (ZCR) [14] as integral components of our feature extraction methodology. These features collectively encapsulate the nuanced characteristics of neonatal bowel sounds.…”
Section: Introductionmentioning
confidence: 99%
“…For the low-level acoustic features, we calculated two typical acoustic features: mel frequency cepstral coefficients (MFCCs) (Grama and Rusu, 2017) and Spectral Centroid (Prasetio et al, 2021).…”
Section: Exploration Of Relationship Between High-level Semantic Feat...mentioning
confidence: 99%
“…where f is the actual measured frequency, Mel(f ) is the Melscale, 2,595 and 700 are the commonly used constants in Mel-scale formula. Spectral Centroid is one of the important physical parameters describing the properties of timber, which indicates where the centroid of the spectrum is located (Prasetio et al, 2021). Generally, the audios with dark and deep quality tend to have more low-frequency components and relatively low Spectral Centroid, while the audios with bright and cheerful quality mostly concentrate on high frequency and relatively high Spectral Centroid.…”
Section: Exploration Of Relationship Between High-level Semantic Feat...mentioning
confidence: 99%