2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017
DOI: 10.1109/icassp.2017.7952700
|View full text |Cite
|
Sign up to set email alerts
|

Multi-view representation learning via gcca for multimodal analysis of Parkinson's disease

Abstract: Information from different bio-signals such as speech, handwriting, and gait have been used to monitor the state of Parkinson's disease (PD) patients, however, all the multimodal bio-signals may not always be available. We propose a method based on multi-view representation learning via generalized canonical correlation analysis (GCCA) for learning a representation of features extracted from handwriting and gait that can be used as a complement to speech-based features. Three different problems are addressed: … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
29
0
2

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 30 publications
(31 citation statements)
references
References 15 publications
0
29
0
2
Order By: Relevance
“…The baseline features include articulation and prosody-based features, which are concatenated to form a 724-dimensional feature vector per utterance (Orozco-Arroyave, 2016;Vasquez-Correa et al, 2017). The articulationbased features includes 86 descriptors such as the energy content distributed in 22 Bark bands in the transition from voiced to unvoiced segments (22 descriptors), and from unvoiced to voiced segments (22 descriptors) OrozcoArroyave et al (2016).…”
Section: Prediction Of Laryngeal Fda Scoresmentioning
confidence: 99%
See 1 more Smart Citation
“…The baseline features include articulation and prosody-based features, which are concatenated to form a 724-dimensional feature vector per utterance (Orozco-Arroyave, 2016;Vasquez-Correa et al, 2017). The articulationbased features includes 86 descriptors such as the energy content distributed in 22 Bark bands in the transition from voiced to unvoiced segments (22 descriptors), and from unvoiced to voiced segments (22 descriptors) OrozcoArroyave et al (2016).…”
Section: Prediction Of Laryngeal Fda Scoresmentioning
confidence: 99%
“…All 50 PD speakers were considered in this evaluation. For the prediction task, we used the same Super Vector Regression as described by Vasquez-Correa et al (2017), using a leave-one-subjectout (LOSO) cross-validation. The performance is evaluated using the Spearman's correlation coefficient between the predicted scores and the real scores.…”
Section: Prediction Of Laryngeal Fda Scoresmentioning
confidence: 99%
“…CCA/GCCA has an impressive array of applications in data mining and machine learning, including clustering [4], regression [5], outlier detection [6], natural language processing and word embedding [7], [8], [9], speech processing [10], heath care data analytics [11], genetics [12], [13], [14] and many C.I. Kanatsoulis more.…”
Section: Introductionmentioning
confidence: 99%
“…Very recently, Fu et al [23] proposed an efficient way to handle the SUMCOR problem, which is the first large-scale SUMCOR algorithm that scales up to truly large views. Fu et al [7] have also considered another popular formulation of GCCA, namely, the MAX-VAR GCCA [24], [10], [11] and proposed highly scalable algorithms for it in [7].…”
Section: Introductionmentioning
confidence: 99%
“…There is still a lot of room for improvement, including a more objective scoring by PD speech specialists. Recent adoption of the Frenchay dysarthria Assessment (FDA) scale and the modified version (m-FDA) [7,8,9,10] have provided an alternative to the subjective UPDRS-III.1 score.…”
Section: Introductionmentioning
confidence: 99%