2022
DOI: 10.1109/access.2022.3219606
|View full text |Cite
|
Sign up to set email alerts
|

RS-MSConvNet: A Novel End-to-End Pathological Voice Detection Model

Abstract: Recent studies have reported the success of multi-scale convolution neural network (MSConvNet) model for many classification applications due to its powerful ability of exploring multiscale convolution block to extract multi-scale representations to make a detection. However, a new design based on MSConvNet for pathological voice detection has not been explored. In this paper, we propose RS-MSConvNet, a novel end-to-end MSConvNet model using raw speech for pathological voice detection. The main contribution of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 45 publications
0
2
0
Order By: Relevance
“…In future work, we aim to investigate other attention mechanisms to further refine our proposed methods. We also plan to incorporate multi-scale convolutional neural networks [35] and phase information [49], [65] as supplementary data to enhance our methodologies. MFCCs based on EMD KNN 88.40 [5] MFCCs based on EMD MLP 88.80 [5] MFCCs based on EMD SVM 96.20 [5] MFCCs based on EMD DNN 98.90 [55] FDPCT VGG16-based transfer learning 94.00 [55] FDPCT ResNet-50-based transfer learning 98.00 [55] FDPCT Deep CNN 99.48 [30] Chirplet transform MCC 98.33 [47] log-mel spectrogram LSTM 95.00 [47] log-mel spectrogram CNN 99.67 [48] LPCC, MFCC SFF-HLSTM 99.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In future work, we aim to investigate other attention mechanisms to further refine our proposed methods. We also plan to incorporate multi-scale convolutional neural networks [35] and phase information [49], [65] as supplementary data to enhance our methodologies. MFCCs based on EMD KNN 88.40 [5] MFCCs based on EMD MLP 88.80 [5] MFCCs based on EMD SVM 96.20 [5] MFCCs based on EMD DNN 98.90 [55] FDPCT VGG16-based transfer learning 94.00 [55] FDPCT ResNet-50-based transfer learning 98.00 [55] FDPCT Deep CNN 99.48 [30] Chirplet transform MCC 98.33 [47] log-mel spectrogram LSTM 95.00 [47] log-mel spectrogram CNN 99.67 [48] LPCC, MFCC SFF-HLSTM 99.…”
Section: Discussionmentioning
confidence: 99%
“…In ad-dition, the study examined various machine learning classifiers, including the support vector machine (SVM) [22]- [27], multiclass composite classifiers (MCC) [28], Multilayer Perceptron (MLP) [29], Random Forest (RF) [30], and k-Nearest Neighbor (k-NN) [31]- [34]. The effectiveness of hand-crafted feature extraction for classification is a significant part of these conventional approaches, demonstrating the need for expertise, e.g., speech processing tasks [35].…”
Section: Referencementioning
confidence: 99%