Cancer has been known as a disease consisting of several different types. Cancer is a life threatening disease in the world today. There are so many types of cancer in the world, one of which is colon cancer. Colon cancer is one of the number one killers in the world. However, because there isn’t any obvious symptom of colon cancer at an early stage, people do not realize that they suffer from it. Even though cancer formation is different for each type of cancer, it is still a big challenge to make cancer classification with good accuracy. Many machine learning has been applied to the data of human’s genes in order to get the most relevant genes in the classification of cancer. The author proposes the Naïve Bayes Classifier model as a classification method to show that the model has good accuracy, good precision, good recall, good f
1 — score in classifying the data of patients suffering from colon cancer or not. In this proposed model, Naïve Bayes Classifier is a technique prediction based on simple probabilistic and on the application of the Bayes theorem (or Bayes rule) with a strong independence assumption. Therefore, this model is able to make higher classification accuracy with less complexity. In particular, it achieves up to 95.24% classification accuracy, thus this model can be an efficient analysis tool.
Prostate cancer is cancer that attacks the prostate gland, usually affecting men over 50 years. Prostate cancer is a disease that develops slowly. Based on this, rapid and precise detection is needed so that the disease can be treated immediately. This study focuses on the application Feature Selection using the Random Forest Classifier to detect prostate cancer. The Random Forest Classifier is a method of classifying data by determining the decision tree. The use of more trees will affect the accuracy to be obtained for the better. The Random Forest Classifier can classify data that has incomplete attributes and can be used to handle large sample data. Selection of features is an important process because it can affect the accuracy of classification. This method increases accuracy by about 87%. Thus, the selection of features can improve accuracy in the detection of prostate cancer.
There are more than 100 types of cancer around the world with different symptoms and difficulty in predicting its appearance in a person due to its random and sudden attack method. However, the appearance of cancer is generally marked by the growth of some abnormal cell. Someone might be diagnosed early and quickly treated, but the cancerous cell most times hides in the body of its victim and reappear, only to kill its sufferer. One of the most common cancers is breast cancer. According to Ministry of Health, in 2018, breast cancer attacked 42 out of every 100.000 people in Indonesia with approximately 17 deaths. In addition, the Ministry recorded a yearly increase in cancer patients. Therefore, there is adequate need to be able to determine those affected by this disease. This study applied the Boruta feature selection to determine the most important features in making a machine learning model. Furthermore, the Random Forest (RF) and Support Vector Machines (SVM) were the machine learning model used, with highest accuracies of 90% and 95% respectively. From the results obtained, the SVM is a better model than random forest in terms of accuracy.
Acute sinusitis is an inflammation of the sinus which causes the cavity around the sinus to swells due to accumulated mucus. It makes the patient experience difficulty in breathing through the nose. Generally, it is caused by the common cold, and in most cases, the patient recovers within seven to ten days. However, persistent acute sinusitis can cause severe infections and other complications. Therefore, it requires timely detection and more accurate method of classification. Many techniques have been used to classify acute sinusitis but, in this study, the machine learning methods which includes Kernel Spherical K-Means (KSPKM) and Support Vector Machine (SVM) was applied. SPKM is the application of K-Means, in this research, it was modified by changing the inner product with kernel function to ensure linear data separation on higher dimensions for the maximization of SPKM performance. The SVM is a binary classification method that helps to create a model with good generalization ability. We used CT scan result data from RSCM, Central Jakarta. Simulations were performed with different percentage of training data. The results were compared in terms of Accuracy and Running Time. The score showed that the performance of KSPKM attained an accuracy rate of 97%, while SVM reached 90%.
Early detection of pancreatic cancer is difficult, and thus many cases of pancreatic cancer are diagnosed late. When pancreatic cancer is detected, the cancer is usually well developed. Machine learning is an approach that is part of artificial intelligence and can detect pancreatic cancer early. This paper proposes a machine learning approach with the twin support vector machine (TWSVM) method as a new approach to detecting pancreatic cancer early. TWSVM aims to find two symmetry planes such that each plane has a distance close to one data class and as far as possible from another data class. TWSVM is fast in building a model and has good generalizations. However, TWSVM requires kernel functions to operate in the feature space. The kernel functions commonly used are the linear kernel, polynomial kernel, and radial basis function (RBF) kernel. This paper uses the TWSVM method with these kernels and compares the best kernel for use by TWSVM to detect pancreatic cancer early. In this paper, the TWSVM model with each kernel is evaluated using a 10-fold cross validation. The results obtained are that TWSVM based on the kernel is able to detect pancreatic cancer with good performance. However, the best kernel obtained is the RBF kernel, which produces an accuracy of 98%, a sensitivity of 97%, a specificity of 100%, and a running time of around 1.3408 s.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.