2021
DOI: 10.3389/fcomp.2020.624488
|View full text |Cite
|
Sign up to set email alerts
|

Pauses for Detection of Alzheimer’s Disease

Abstract: Pauses, disfluencies and language problems in Alzheimer’s disease can be naturally modeled by fine-tuning Transformer-based pre-trained language models such as BERT and ERNIE. Using this method with pause-encoded transcripts, we achieved 89.6% accuracy on the test set of the ADReSS (Alzheimer’s Dementia Recognition through Spontaneous Speech) Challenge. The best accuracy was obtained with ERNIE, plus an encoding of pauses. Robustness is a challenge for large models and small training sets. Ensemble over many r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
20
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 18 publications
(22 citation statements)
references
References 37 publications
1
20
0
Order By: Relevance
“…They obtained 90% accuracy and demonstrated the importance of dysfluency and pauses in detecting AD. The champion of the ADReSS challenge ( 10 ) combined deep learning with pauses and obtained SOTA accuracy of 89.6%, proving that pauses are important for AD diagnosis. Sadeghian et al ( 41 ) extracted acoustic features, including pauses more than 5 s in duration, and obtained the best accuracy of 95.8%.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…They obtained 90% accuracy and demonstrated the importance of dysfluency and pauses in detecting AD. The champion of the ADReSS challenge ( 10 ) combined deep learning with pauses and obtained SOTA accuracy of 89.6%, proving that pauses are important for AD diagnosis. Sadeghian et al ( 41 ) extracted acoustic features, including pauses more than 5 s in duration, and obtained the best accuracy of 95.8%.…”
Section: Discussionmentioning
confidence: 99%
“…The parameters of the distilBert model are presented in Table 4 . The champion of the ADReSS challenge obtained an accuracy of 0.896 by combining the Enhanced Language Representation with Informative Entities (ERNIE) model ( 39 ) and pause information in speech using acoustic align technology ( 10 ). We achieved 88% accuracy on the test set, which is almost equivalent to the SOTA result, and a 13% improvement over the baseline of 75%, established by the organizers of ADReSS ( 9 ).…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…There are many more use cases for fine-tuning. We have used fine-tuning in speech for classifying emotions and Alzheimer's disease (Yuan et al 2021b;Yuan et al 2021a). Fine-tuning is likely to become one of the more popular methods for an extremely wide range of use cases in many fields including computational linguistics, speech, and vision.…”
Section: Discussionmentioning
confidence: 99%