Dementia is a degenerative disease that is increasingly prevalent in an aging society. Alzheimer’s disease (AD), the most common type of dementia, is best mitigated via early detection and management. Deep learning is an artificial intelligence technique that has been used to diagnose and predict diseases by extracting meaningful features from medical images. The convolutional neural network (CNN) is a representative application of deep learning, serving as a powerful tool for the diagnosis of AD. Recently, vision transformers (ViT) have yielded classification performance exceeding that of CNN in some diagnostic image classifications. Because the brain is a very complex network with interrelated regions, ViT, which captures direct relationships between images, may be more effective for brain image analysis than CNN. Therefore, we propose a method for classifying dementia images by applying 18F-Florbetaben positron emission tomography (PET) images to ViT. Data were evaluated via binary (normal control and abnormal) and ternary (healthy control, mild cognitive impairment, and AD) classification. In a performance comparison with the CNN, VGG19 was selected as the comparison model. Consequently, ViT yielded more effective performance than VGG19 in binary classification. However, in ternary classification, the performance of ViT cannot be considered excellent. These results show that it is hard to argue that the ViT model is better at AD classification than the CNN model.
18F-florbetaben (FBB) positron emission tomography is a representative imaging test that observes amyloid deposition in the brain. Compared to delay-phase FBB (dFBB), early-phase FBB shows patterns related to glucose metabolism in 18F-fluorodeoxyglucose perfusion images. The purpose of this study is to prove that classification accuracy is higher when using dual-phase FBB (dual FBB) versus dFBB quantitative analysis by using machine learning and to find an optimal machine learning model suitable for dual FBB quantitative analysis data. The key features of our method are (1) a feature ranking method for each phase of FBB with a cross-validated F1 score and (2) a quantitative diagnostic model based on machine learning methods. We compared four classification models: support vector machine, naïve Bayes, logistic regression, and random forest (RF). In composite standardized uptake value ratio, RF achieved the best performance (F1: 78.06%) with dual FBB, which was 4.83% higher than the result with dFBB. In conclusion, regardless of the two quantitative analysis methods, using the dual FBB has a higher classification accuracy than using the dFBB. The RF model is the machine learning model that best classifies a dual FBB. The regions that have the greatest influence on the classification of dual FBB are the frontal and temporal lobes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.