This article describes how the enormous size of data in IoT needs efficient data mining model for information extraction, classification and mining hidden patterns from data. CBR is a learning, mining and problem-solving approach which solves a problem by relating past similar solved problems. One issue with CBR is feature weight to measure the similarity among cases to mine similar past cases. NN's pruning is a popular method, which extracts feature weights from a trained neural network without losing much generality of the training set by using four mechanisms: sensitivity, activity, saliency and relevance. However, training NN with imbalanced data leads the classifier to get biased towards the majority class. Therefore, this article proposes a hybrid CBR model with RUS and cost sensitive back propagation neural network in IoT environment to deal with the feature weighting problem in imbalance data. The proposed model is validated with six real-life datasets. The experimental results show that the proposed model is better than other feature weighting methods.
Data Mining is a powerful technology to help organization to concentrate on most important data by extracting useful information from large database. One of the most commonly used techniques in data mining is Artificial Neural Network due to its high performance in many application domains. Despite many advantages of Artificial Neural Network, one of its main drawbacks is its inherent black box nature which is the main problem of using Artificial Neural Network in data mining. Therefore, this paper proposes a rule extraction algorithm from neural network using classified and misclassified data to convert the black box nature of Artificial Neural Network into a white box. The proposed algorithm is a modification of the existing algorithm, Rule Extraction by Reverse Engineering (RxREN). The proposed algorithm extracts rules from trained neural network for datasets with mixed mode attributes using pedagogical approach. The proposed algorithm uses both classified as well as misclassified data to find out the data ranges of significant attributes in respective classes, which is the innovation of the proposed algorithm. The experimental results clearly show that the performance of the proposed algorithm is superior to existing algorithms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.