Recent advances in experimental and computational methods are increasing the quantity and complexity of generated data. This massive amount of raw data needs to be stored and interpreted in order to advance the materials science field. Identifying correlations and patterns from large amounts of complex data is being performed by machine learning algorithms for decades. Recently, the materials science community started to invest in these methodologies to extract knowledge and insights from the accumulated data. This review follows a logical sequence starting from density functional theory as the representative instance of electronic structure methods, to the subsequent high-throughput approach, used to generate large amounts of data. Ultimately, data-driven strategies which include data mining, screening, and machine learning techniques, employ the data generated. We show how these approaches to modern computational materials science are being used to uncover complexities and design novel materials with enhanced properties. Finally, we point to the present research problems, challenges, and potential future perspectives of this new exciting field. characterized by its diverse and huge volume, usually ranging from terabytes to petabytes of data, being created in or near real-time. Such data is found either structured and unstructured in nature, and is exhaustive, usually aiming to capture entire populations in a scalable manner [7]. Simple tasks represent challenges in this scale: capture, curation, storage, search, sharing, analysis, and visualization of the data cannot be accomplished without the proper tools. Thus, it can be effectively summarized by the popular 'five V's': volume, velocity, variety, veracity, and value, shown in figure 3(right). A related sixth V is visualization, although not exclusive to Big Data, which requires different techniques to handle data with various characteristics.Striving to tackle the challenges imposed by Big Data, the field of Data Science has arisen. It is largely interdisciplinary being a combination of mathematics and statistics, computer science and programming, and domain knowledge for problem definition and solving, as shown in figure 3(left). Its objective is, roughly speaking, to deal with the whole process of data production, cleaning, preparation, and finally, analysis. Data science encompasses areas such as Big Data, which deals with large volumes of data, and data mining, which relates to analysis processes to discover patterns and extract knowledge from data, part of the so-called Knowledge Discovery in Databases (KDD).The analysis process within Data Science is challenging, as the techniques are very different from traditional static and rigid datasets, generated and analyzed under a predetermined hypothesis. The distinction from traditional data is based on the larger abundance, exhaustivity, and variety of Big Data. It is also much more dynamic, messy and uncertain, being highly relational [7]. Recently, the possibility of overcoming such a challenge slowly sta...