2021
DOI: 10.3389/fcomp.2021.624659
|View full text |Cite
|
Sign up to set email alerts
|

Learning Language and Acoustic Models for Identifying Alzheimer’s Dementia From Speech

Abstract: Alzheimer’s dementia (AD) is a chronic neurodegenerative illness that manifests in a gradual decline of cognitive function. Early identification of AD is essential for managing the ensuing cognitive deficits, which may lead to a better prognostic outcome. Speech data can serve as a window into cognitive functioning and can be used to screen for early signs of AD. This paper describes methods for learning models using speech samples from the DementiaBank database, for identifying which subjects have Alzheimer’s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
27
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 27 publications
(30 citation statements)
references
References 19 publications
0
27
0
Order By: Relevance
“…Regarding BERT + ViT + Co-Attention, it improves the RMSE scores of all the existing research initiatives, except Bimodal Network (Ensembled Output) (Koo et al, 2020 ), by 0.14–1.41. In terms of the Multimodal BERT - eGeMAPS, Multimodal BERT - ViT, and Multimodal BERT - eGeMAPS + ViT, it seems that these architectures are rather complex for the MMSE regression task improving the RMSE score of only one research work (Shah et al, 2021 ).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Regarding BERT + ViT + Co-Attention, it improves the RMSE scores of all the existing research initiatives, except Bimodal Network (Ensembled Output) (Koo et al, 2020 ), by 0.14–1.41. In terms of the Multimodal BERT - eGeMAPS, Multimodal BERT - ViT, and Multimodal BERT - eGeMAPS + ViT, it seems that these architectures are rather complex for the MMSE regression task improving the RMSE score of only one research work (Shah et al, 2021 ).…”
Section: Resultsmentioning
confidence: 99%
“…Shah et al ( 2021 ) used also an ensemble method to predict AD patients. Specifically, after training acoustic and language models, they chose the three best performing acoustic models and the best performing language model.…”
Section: Related Workmentioning
confidence: 99%
“…Table 3 shows the ranked features from both feature sets, together with their Pearson's correlation (r) with the diagnosis class; due to space limitations, we only show the top 10 features. 2 The most significant acoustic feature was LogHNR, known to be important in acoustic analysis for the diagnosis of pathological voices; loudness, raw fundamental frequency, variation in jitter, intensity, and LogHNR all positively correlate with AD and have been reported as useful features in literature for Dementia [31,9]. Among interactional features, lapses are positively correlated with AD, indicating that patients find trouble continuing topics, resulting in delays with interviewers initiating a new topic.…”
Section: Resultsmentioning
confidence: 99%
“…Its highest incidence is among adults due to age as a risk factor: one in every six individuals over the age of 80 is likely to develop AD and the number of cases over the age of 60 is doubling every 45 years [1]. Early recognition of cognitive decline could be helpful in managing pre-stage AD thus allowing better quality of life for elderly patients and their caregivers [2].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation