Cancer leads to most deaths worldwide, and breast cancer isa leading disease that causes death among women. This disease is unique in the way that once treated, can recur in some cases. Individuals are unable to identify their condition before it becomes dangerous. Extracting significant predictive features of breast cancer is an important and risky job for further study.
Researchers have applied data mining techniques in medical science. Several authors suggest that a single method doesn't resolve issues in diagnosing problems, and a hybrid model is desirable. In this paper, the authors propose an integrated approachto avoid Type 1 and 2 errors in predicting recurrence. They identify important and contradicting variables and consider them for inclusion and exclusion, respectively, to revise the dataset. The evaluation of findings of crucial methods, using original and revised datasets, widens the choice of identifying the technique with higher accuracy. The results show that the accuracy improves with the selection of variables restricted to the ones identified as relatively significant, and the dataset revised after elimination of contradicting variables.