Powdery mildew is a dangerous disease that reduces the quality and the yield of tomato fruit rapidly. Its early prediction is a prior requirement for obtaining good quality fruit. Therefore, in this study, the best classifier amongst various classifiers has been discovered using different machine learning algorithms. This classifier can precisely classify whether the meteorological conditions of a particular day are conducive to the development of powdery mildew disease or not. Tomato powdery mildew disease dataset has been tested using various performance measures and the results computed for all the classifiers are promising. Friedman test has been used to rank multiple classifiers and post hoc analysis has also been done using the Nemenyi test. It has been observed in comparison that 62.05% of the total pairs of classifiers perform significantly different from each other, and medium Gaussian support vector machine (MGSVM) is the best classifier with 94.74% accuracy.
Plant diseases can cause significant losses to agricultural productivity; therefore, their early prediction is much needed. So far, many machine learning-based plant disease prediction models have been recommended, but these models face a problem of noisy class label dataset that degrades the performance. Noisy class label dataset results from the improper assignment of positive class labels into negative class data samples or vice versa. Hence, a precise and noise-free plant disease model is required for a better prediction. The current study proposes noise reduction-based hybridized classifiers for plant disease prediction. One tomato and four soybean disease datasets have been selected to conduct the proposed research. The Adaptive Sampling-based Class Label Noise Reduction (AS-CLNR) method has been used along with the Support Vector Machine (SVM) approach for noise reduction. The noise-minimized datasets have been fed into the Extreme Learning Machine (ELM), Decision Tree (DT), and Random Forest (RF) classifiers whose parameters are optimized using Genetic Algorithm (GA) for developing plant disease prediction models. The performances of all these models viz. Hybrid SVM-GA-ELM, Hybrid SVM-GA-DT, and Hybrid SVM-GA-RF have been evaluated using Accuracy, Area under ROC Curve, and F1-Score metrics. Further, these classifiers have been ranked using the statistical Friedman Test in which the Hybrid SVM-GA-RF classifier performed the best. Lastly, the Nemenyi test has also been performed to find out if significant differences exist between various classifiers or not. It was found that 33.33% of the total pairs of hybrid classifiers show a remarkably different performance from one another.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.