Breast cancer is a very dangerous disease that mainly affects women. It is a deadliest disease that highly affects the women's life. Therefore, it is necessary to predict and classify this deadly disease for early diagnosis. There exist numerous data mining techniques for early prediction and classification of this disease. The big data based analytical model provides the better solution for storing, manipulating, and analyzing a great number of mammographic images. In this article, a new improved fractional rough fuzzy K-means clustering strategy is considered for disease prediction. Then, a new Tunicate Swarm Algorithm (TSA) is introduced to optimize the weight parameters.TSA is a bio-inspired metaheuristic optimization approach. Finally, the labeled ensemble classifier (LEC) is utilized for classifying the stages of breast cancer as malignant and benign. Here, the data is randomly generated from breast cancer Wisconsin dataset (diagnosis) obtainable on UCI machine learning repository. The proposed strategy is compared with different existing strategies, like Logistic Regression Classifier, Random Forest Classifier. From the analysis, it is observed that the proposed big data based analytical model using LEC provides 99.3% accuracy that is very high when compared to the accuracy of existing approaches.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.