The purpose of this research is to demonstrate the ability of machine-learning (ML) methods for liver cancer classification using a fused dataset of two-dimensional (2D) computed tomography (CT) scans and magnetic resonance imaging (MRI). Datasets of benign (hepatocellular adenoma, hemangioma, cyst) and malignant (hepatocellular carcinoma, hepatoblastoma, metastasis) liver cancer were acquired at Bahawal Victoria Hospital (BVH), Bahawalpur, Pakistan. The final dataset was generated by fusion of 1200 (100 × 6 × 2) MR and CT-scan images, 200 (100 MRI and 100 CT-scan) images size 512 × 512 for each class of cancer. The acquired dataset was preprocessed by employing the Gabor filters to reduce the noise and taking an automated region of interest (ROIs) using an Otsu thresholding-based segmentation approach. The preprocessed dataset was used to acquire 254 hybrid-feature data for each ROI, which is the combination of the histogram, wavelet, co-occurrence, and run-length features, while 10 optimized hybrid features were selected by employing (probability of error plus average correlation) feature selection technique. For classification, we deployed this optimized hybrid-feature dataset to four ML classifiers: multilayer perceptron (MLP), support vector machine (SVM), random forest (RF), and J48, using a ten fold cross-validation method. MLP showed an overall accuracy of (95.78% on MRI and 97.44% on CT). Unfortunately, the obtained results were not promising, and there were some limitations due to the different modalities of the dataset. Thereafter, a fusion of MRI and CT-scan datasets generated the fused optimized hybrid-feature dataset. The MLP has shown a promising accuracy of 99% among all the deployed classifiers.
This study proposes the machine learning based classification of medical plant leaves. The total six varieties of medicinal plant leaves-based dataset are collected from the Department of Agriculture, The Islamia University of Bahawalpur, Pakistan. These plants are commonly named in English as (herbal) Tulsi, Peppermint, Bael, Lemon balm, Catnip, and Stevia and scientifically named in Latin as Ocimum sanctum, Mentha balsamea, Aegle marmelos, Melissa officinalis, Nepeta cataria, and Stevia rebaudiana, respectively. The multispectral and digital image dataset are collected via a computer vision laboratory setup. For the preprocessing step, we crop the region of the leaf and transform it into a gray level format. Secondly, we perform a seed intensity-based edge/line detection utilizing Sobel filter and draw five regions of observations. A total of 65 fused features dataset is extracted, being a combination of texture, run-length matrix, and multi-spectral features. For the feature optimization process, we employ a chi-square feature selection approach and select 14 optimized features. Finally, five machine learning classifiers named as a multi-layer perceptron, logit-boost, bagging, random forest, and simple logistic are deployed on an optimized medicinal plant leaves dataset, and it is observed that the multi-layer perceptron classifier shows a relatively promising accuracy of 99.01% as compared to the competition. The distinct classification accuracy by the multi-layer perceptron classifier on six medicinal plant leaves are 99.10% for Tulsi, 99.80% for Peppermint, 98.40% for Bael, 99.90% for Lemon balm, 98.40% for Catnip, and 99.20% for Stevia.
The purpose of this research is the segmentation of lungs computed tomography (CT) scan for the diagnosis of COVID-19 by using machine learning methods. Our dataset contains data from patients who are prone to the epidemic. It contains three types of lungs CT images (Normal, Pneumonia, collected from two different sources; the first one is the Radiology Department of Nishtar Hospital Multan and Civil Hospital Bahawalpur, Pakistan, and the second one is a publicly free available medical imaging database known as Radiopaedia. For the preprocessing, a novel fuzzy c-mean automated region-growing segmentation approach is deployed to take an automated region of interest (ROIs) and acquire 52 hybrid statistical features for each ROIs. Also, 12 optimized statistical features are selected via the chi-square feature reduction technique. For the classification, five machine learning classifiers named as deep learning J4, multilayer perceptron, support vector machine, random forest, and naive Bayes are deployed to optimize the hybrid statistical features dataset. It is observed that the deep learning J4 has promising results (sensitivity and specificity: 0.987; accuracy: 98.67%) among all the deployed classifiers. As a complementary study, a statistical work
The purpose of this study is the statistical analysis and discrimination of maize seed using a machine vision (MV) approach. The foundation of the digital image dataset holds six maize seed varieties named as Kargal K-9803, Gujjar Khan, Desi White, Pioner 30Y87, Syngenta ST-6142, and Pioner 31R88. The digital image dataset acquired via a digital imaging laboratory. For preprocessing, we crop the image into a size of 600×600 pixels, and convert it into a gray level image format. After that, line and edge detection are performed by using a Prewitt filter, and five non-overlapping areas of interest (AOIs) size of (200×200), and (250×250) are drawn. A total of 56 statistical features, containing texture features, histogram features, and spectral features, is extracted from each AOI. The 11 optimized statistical features have been selected by deploying “Correlation-based Feature Selection” (CFS) with the Greedy algorithm. For the discrimination analysis, four MV classifiers named as “Support Vector Machine” (SVM), “Logistic” (Lg), “Bagging” (B), and “LogitBoost” (LB) have been deployed on optimized statistical features dataset. After analysis, the SVM classifier has shown a promising accuracy of 99.93% on AOIs size (250×250). The obtained accuracy by SVM classifier on six maize seed varieties, namely Kargal K-9803, Gujjar Khan, Desi White, Pioner 30Y87, Syngenta ST-6142, and Pioner 31R88, were 99.9%, 99.8%, 100%, 100%, 99.9%, and 99.8%, respectively.
The purpose of this study is to discriminate sunflower seeds with the help of a dataset having spectral and textural features. The production of crop based on seed purity and quality other hand sunflower seed used for oil content worldwide. In this regard, the foundation of a dataset categorizes sunflower seed varieties (Syngenta CG, HS360, S278, HS30, Armani, and High Sun 33), which were acquired from the agricultural farms of The Islamia University of Bahawalpur, Pakistan, into six classes. For preprocessing, a new region-oriented seed-based segmentation was deployed for the automatic selection of regions and extraction of 53 multi-features from each region, while 11 optimized fused multi-features were selected using the chi-square feature selection technique. For discrimination, four supervised classifiers, namely, deep learning J4, support vector machine, random committee, and Bayes net, were employed to optimize the multi-feature dataset. We observe very promising accuracies of 98.2%, 97.5%, 96.6%, and 94.8%, respectively, when the size of a region is (180 × 180).
Background: Humans can deliver many emotions during a conversation. Facial expressions show information about emotions. Objectives: This study proposed a Machine Learning (ML) approach based on a statistical analysis of emotion recognition using facial expression through a digital image. Methodology: A total of 600 digital image datasets divided into 6 classes (Anger, Happy, Fear, Surprise, Sad, and Normal) was collected from publicly available Taiwan Facial Expression Images Database. In the first step, all images are converted into a gray level format and 4 Regions of Interest (ROIs) are created on each image, so the total image dataset gets divided in 2400 (600 x 4) sub-images. In the second step, 3 types of statistical features named texture, histogram, and binary feature are extracted from each ROIs. The third step is a statistical feature optimization using the best-first search algorithm. Lastly, an optimized statistical feature dataset is deployed on various ML classifiers. Results: The analysis part was divided into two phases: firstly boosting algorithms-based ML classifiers (named as LogitBoost, AdaboostM1, and Stacking) which obtained 94.11%, 92.15%, and 89.21% accuracy, respectively. Secondly, decision tree algorithms named J48, Random Forest, and Random Committee were obtained with 97.05%, 93.14%, and 92.15% accuracy, respectively. Conclusion: It was observed that decision tree based J48 classifiers gave 97.05% classification accuracy.
The purpose of this learning is to detect the Corn Seed Fusarium Disease using Hybrid Feature Space and Conventional machine learning (ML) approaches. A novel machine learning approach is employed for the classification of a total of six types of corn seed are collected which contain Infected Fusarium (moniliforme, graminearum, gibberella, verticillioides, kernel) as well as healthy corn seed, based on a multi-feature dataset, which is the grouping of geometric, texture and histogram features extracted from digital images. For each corn seed image, a total of twenty-five multi-features have been developed on every area of interest (AOI), sizes (50 × 50), (100 × 100), (150 × 150), and (200 × 200). A total of seven optimized features were selected by using a machine learning-based algorithm named “Correlation-based Feature Selection”. For experimentation, “Random forest”, “BayesNet” and “LogitBoost” have been employed using an optimized multi-feature user-supplied dataset divided with 70% training and 30 % testing. A comparative analysis of three ML classifiers RF, BN, and LB have been used and a considerably very high classification ratio of 96.67 %, 97.22 %, and 97.78 % have been achieved respectively when the AOI size (200×200) have been deployed to the classifiers.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.