Breast cancer is the most predominantly occurring cancer in the world. Several genes and
proteins have been recently studied to predict biomarkers that enable disease identification early and
monitor its recurrence. In the era of high-throughput technology, studies show several applications
of big data for identifying potential biomarkers. The review aims to provide a comprehensive overview of big data analysis in breast cancer towards the prediction of biomarkers with emphasis on
computational methods like text mining, network analysis, next-generation technology (NGS), machine learning (ML), deep learning (DL), and precision medicine. Integrating data from various
computational approaches enables the stratification of cancer patients and the identification of molecular signatures in cancer and their subtypes. The computational methods and statistical analysis
help expedite cancer prognosis and develop precision cancer medicine (PCM). As a case study in
the present work, we constructed a large gene-drug interaction network to predict new biomarkers
genes. The gene-drug network helped us to identify eight genes that could be novel potential biomarkers.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.