The rapid increase in the growth of text information over the past two decades has led to the need for the use of text classification techniques, particularly in the area of information retrieval, data mining and data management. The precise results and simplicity of the K-Nearest Neighbor Classification Algorithm (K-NN) in knowledge mining is the reason that made it one of the most important classification algorithms used in many tasks such as pattern recognition, regression, and text classification. Through experiments and analysis of the results of the use of the traditional algorithm of the (K-NN), there are some deficiencies in their performance, especially when the data are large such as the algorithm was unable to process big data by rapid extraction with minimal storage space and generate useless samples computation and probability problems. In this paper, we have developed an enhanced algorithm and get the best results and perform better than that in the traditional algorithm. The significant improvement in our model performance is due to the improvement by removing unnecessary computational samples in the traditional algorithm. The performance is further improved by using the lost value computational method to define results as a prelude to avoid wasting time by correcting and filtering noise, examining the database, and eliminating unwanted records. Additionally, the inverse logarithmic function was used to solve the probability problems the algorithm encounters. The experimental results showed the efficiency of the modified algorithm in reducing the sample size and speeding up the search for the required data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.