Background:
Autistic Spectrum Disorder (ASD) is a disorder associated with genetic and
neurological components leading to difficulties in social interaction and communication. According
to statistics of WHO, the number of patients diagnosed with ASD is gradually increasing. Most of
the current studies focus on clinical diagnosis, data collection and brain images analysis, but do not
focus on the diagnosis of ASD based on machine learning.
Objective:
This study aims to classify ASD data to provide a quick, accessible and easy way to support
early diagnosis of ASD.
Methods:
Three ASD datasets are used for children, adolescences and adults. To classify the ASD
data, we used the k-Nearest Neighbours method (kNN), the Support Vector Machine method (SVM)
and the Random Forests method (RF). In our experiments, the data was randomly split into training
and test sets. The parts of the data were randomly selected 100 times to test the classification methods.
Results:
The final results were assessed by the average values. It is shown that SVM and RF are
effective methods for ASD classification. In particular, the RF method classified the data with an
accuracy of 100% for all above datasets.
Conclusion:
The early diagnosis of ASD is critical. If the number of data samples is large enough,
we can achieve a high accuracy for machine learning-based ASD diagnosis. Among three classification
methods, RF achieves the best performance for ASD data classification.
Background:
The modern society is extremely prone to many life-threatening diseases, which can be easily controlled as well as cured if diagnosed at an early stage. The development and implementation of a disease diagnostic system have gained huge popularity over the years. In the current scenario, there are certain factors such as environment, sedentary lifestyle, genetic (hereditary) are the major factors behind the life threatening disease such as, ‘diabetes’. Moreover, diabetes has achieved the status of the modern man’s leading chronic disease. So one of the prime need of this generation is to develop a state-of-the-art expert system which can predict diabetes at a very early stage with a minimum of complexity and in an expedited manner. The primary objective of this research work is to develop an indigenous and efficient diagnostic technique for detection of the diabetes.
Method & Discussion:
The proposed methodology comprises of two phases: In the first phase The Pima Indian Diabetes Dataset (PIDD) has been collected from the UCI machine learning repository databases and Localized Diabetes Dataset (LDD) has been gathered from Bombay Medical Hall, Upper Bazar Ranchi, Jharkhand, India. In the second phase, the dataset has been processed through two different approaches. The first approach entails classification through Adaboost, Classification Via Regression (CVR), Radial Basis Function Network (RBFN), K-Nearest Neighbor (KNN) on Pima Indian Diabetes Dataset and Localized Diabetes Dataset. In the second approach, Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) have been applied as a feature reduction method followed by using the same set of classification methods used in the first approach. Among all of the implemented classification method, PCA_CVR performs the highest performance for both the above mentioned dataset.
Conclusion:
In this research article, comparative analysis of outcomes obtained by with and without the use of PCA and LDA for the same set of classification method has been done w.r.t performance assessment. Finally, it has been concluded that PCA & LDA both is useful to remove the insignificant features, decreasing the expense and computation time while improving the ROC and accuracy. The used methodology may similarly be applied in other medical diseases.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.