Big data is one of the hottest topics in information science. It has become the key for biological and medical discovery and research, making developing new methods for data management, analysis and accessibility a great challenge in the field. This study proposes an integrated gene analysis approach, in terms of classification and prediction methods for understanding, analyzing and interpretation of biological data related to cancer. The final aim of this study is to predict and classify several subtypes of cancer, based on gene expression.