Objective: Breast cancer is one of the most common cancers affecting women. Both physicians and patients have concerned about breast cancer survivability. Many researchers have studied the breast cancer survivability applying artificial nerural network model (ANN). Usually ANN model outperformed in classification of breast cancer survivability than other models such as logistic regression, Bayesian network (BN), or decision tree models. However, physicians in the fields hesitate to use ANN model, because ANN is a black-box model, and hard to explain the classification result to patients. In this study, we proposed a hybrid model with a degree of the accuracy and interpretation by combining the ANN for accuracy and BN for interpretation. Methods: We developed an artificial neural network, a Bayesian network, and a hybrid Bayesian network model to predict breast cancer prognosis. The hybrid model combined the artificial neural network and the Bayesian network to obtain a good estimation of prognosis as well as a good explanation of the results. The National Cancer Institute's SEER program public-use data were used to construct and evaluate the proposed models. Nine variables, which are clinically acceptable, were selected for input to the proposed models' nodes. A confidence value of the neural network served as an additional input node to the hybrid Bayesian network model. Ten iterations of random subsampling were performed to evaluate performance of the models. . The known risk factors are age at diagnosis, age of menarche, genetic risk, and family history [3][4][5][6] . The most widely accepted prognostic factor for breast cancer is the American Joint Commission on Cancer (AJCC) staging system based on the TNM system (T, tumor; N, node; M, metastasis) [7][8][9][10][11] . The Nottingham prognosis index is a grading system that incorporates the evaluation of tumor size, stage of disease, and tumor grade 12)13). Recently, molecular biologic markers, such as the estrogen receptor, progesterone receptor, and HER-2/neu, have become important prognostic factors [14][15][16] .Several studies have reported on breast cancer prognosis prediction using data mining techniques. Burke et al. . They used 6,787 cases of the National Cancer Institute's Surveillance, Epidemiology, and End Results (SEER) breast carcinoma data set (1977)(1978)(1979)(1980)(1981)(1982) . Lee compared various predictive modeling techniques to predict the breast cancer occurance (not prognosis) by using Korean data obtained from 209 subject (109 patients and 100 controls).The AUC of the proposed Naïve Bayes model was 0.90, and it outperformed than other various Bayesian network models or regression models 19) .Lundin et al. compared logistic regression and ANN models for survival estimation in 951 breast cancer cases 20) . Eight variables were entered as input to the network, including tumor size, axillary node, histologic type, mitotic count, nuclear pleomorphism, tubule formation, tumor necrosis, and age. The area under the curve (AUC) of th...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.