2019
DOI: 10.1016/j.jvoice.2018.02.003
|View full text |Cite
|
Sign up to set email alerts
|

Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

2
90
0
3

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
1
1

Relationship

1
9

Authors

Journals

citations
Cited by 171 publications
(103 citation statements)
references
References 19 publications
2
90
0
3
Order By: Relevance
“…In addition, a recent study confirms that a deep neural network model can be used to detect voice disorders with high accuracy based on voice samples [40]. Meanwhile, it has been confirmed that the ANN can incorporate heterogeneous data to achieve better classification and regression performance [41, 42].…”
Section: Discussionmentioning
confidence: 99%
“…In addition, a recent study confirms that a deep neural network model can be used to detect voice disorders with high accuracy based on voice samples [40]. Meanwhile, it has been confirmed that the ANN can incorporate heterogeneous data to achieve better classification and regression performance [41, 42].…”
Section: Discussionmentioning
confidence: 99%
“…Objective metrics obtained using various acoustic instruments have been investigated, and attempts have been made to correlate these with perceptual voice quality assessments [8][9][10][11][12].A plethora of temporal, spectral, and cepstral metrics have been proposed to evaluate voice quality [13,14]. Commonly used features or vocal metrics include fundamental frequency ( f 0), loudness, jitter, shimmer, vocal formants, harmonic-to-noise ratio (HNR), spectral tilt (H1-H2, harmonic richness factor), maximum flow declination rate (MFDR), duty ratio, cepstral peak prominence (CPP), Mel-frequency cepstral coefficients (MFCCs), power spectrum ratio, and others [15][16][17][18][19]. Self-reported feelings of decreased vocal functionality have been used as a criterion for vocal fatigue in many previous studies [1,4,[20][21][22].…”
mentioning
confidence: 99%
“…A subset of the corpus previously described in [4] has been supplied by the organizers of the challenge. The provided dataset has been divided into a training and testing partition for the purposes of performance evaluation.…”
Section: A Corpusmentioning
confidence: 99%