2016 2nd International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) 2016
DOI: 10.1109/atsip.2016.7523111
|View full text |Cite
|
Sign up to set email alerts
|

Towards a computer tool for automatic detection of laryngeal cancer

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
3
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 8 publications
0
3
0
Order By: Relevance
“…As dysphonia is often the first symptom in LC, many authors have attempted to use ML to detect LC based on voice recordings (Table 4). While previous studies used extracted acoustic features and perturbation measures to differentiate voices of patients with LC from healthy voices, 27 more recent technology converts the acoustic data into a visual Mel spectrogram (a visual representation of the frequency spectrum of an acoustic signal) and feeds an image to the ML model to make such distinction 28 . In a recent study, Kim et al showed accuracy as high as 85% in detecting pathologic voice changes in LC by using voice samples converted to Mel spectrograms 28 .…”
Section: Discussionmentioning
confidence: 99%
“…As dysphonia is often the first symptom in LC, many authors have attempted to use ML to detect LC based on voice recordings (Table 4). While previous studies used extracted acoustic features and perturbation measures to differentiate voices of patients with LC from healthy voices, 27 more recent technology converts the acoustic data into a visual Mel spectrogram (a visual representation of the frequency spectrum of an acoustic signal) and feeds an image to the ML model to make such distinction 28 . In a recent study, Kim et al showed accuracy as high as 85% in detecting pathologic voice changes in LC by using voice samples converted to Mel spectrograms 28 .…”
Section: Discussionmentioning
confidence: 99%
“…Several ML methods using vocal recordings perform binary classification to distinguish voices from patients with laryngeal cancer from those with healthy voices or benign voice disorders with accuracy ranging from 85.2% to 98% [81][82][83][84][85][86][87][88]89 ], which is derived from a transformation of the audio signal and provides a compact representation of the spectral properties of a sound. Others algorithms rely on acoustic features (jitter, shimmer, and harmonic features) [81,88] and glottal air-flow parameters [84,86]. Studies in this category use various preprocessing, feature extraction, and classifications methods, including different neural network methods [81-84,86,87,89 & ], support vector machine [81,85], hidden Markov models [88], and Gaussian mixture models [85].…”
Section: Machine Learning Models Utilizing Voice and Speech To Screen...mentioning
confidence: 99%
“…Within this frame, the present research project was designed to test a novel COVID-19 screening tool based on voice analysis through machine learning (ML). Conventional voice analysis proved useful in detecting distinguishing acoustic features of pathologies impairing all structures and systems responsible for phonation, including lungs [17][18][19], trachea [20], larynx [21][22][23], vocal folds [24,25] and central nervous system [26][27][28][29][30]. Furthermore, encouraging results had been obtained for disorders impairing voice production mechanisms only secondarily, including cardiovascular diseases [31][32][33][34] and diabetes [35].…”
Section: Introductionmentioning
confidence: 99%