Introduction With the penetration and suffusion of information and communication technology (ICT) in our lives, scientific research has evolved as well. As such, scientific research is more data intensive and derives information from massive volumes of digitized data. As of 2013, 2.5 quintillion bytes of data are being produced every day (https://www-01.ibm.com/software/data/bigdata/what-is-big-data.html), 90% of which was produced in the last two years (SINTEF, 2013). A correct assumption is that the amount of data being produced will continue to increase. For instance, Internet users numbered 2.8 billion in 2013, whereas today, they number more than 3.5 billion (http://www.internetlivestats.com/internet-users/). The use of social media has increased the amount of data being produced. The total amount of data in the world is expected to be 4.1 zetabytes in 2016 and is estimated to be 40 zetabytes in 2020. Therefore, data management has become an important issue. Likewise, in the scientific arena, data has become so prominent that it has been given a new name in "The Fourth Paradigm: Data-Intensive Scientific Discovery" in which "all of the science literature is online, all of the science data is online, and they interoperate with each other" (Hey et al., 2009). In previous paradigms scientific activities were driven by experimentation, theory, and computation (Hey et al., 2009). The traditional hypothesis-based scientific approach has been gradually replaced by the analyses of electronic databases that can hold large amounts of information. As papers, lab books, tapes, and photographic films have moved to digital archives, cloud storages, and data warehouses, science has gone beyond the boundaries of hypotheses. Analyses are built on the collections themselves, and patterns, anomalies, and diversities on which questions will be posed later are sought. Hence, the term "data-intensive science" has emerged, and this practice derives information from the datasets collected by various computerized modeling and simulation systems, imaging devices, sensors and sensor networks, and other data gathering and storage techniques (Hey et al., 2009; Knyazkov et al., 2012). The vision is to have "all of the science literature online, all of the science data online, and interoperate with each other" (Hey et al., 2009). These mega-scale databases consist of data captured by various novel scientific tools, sometimes on a realtime basis. With this continuous flow of electronic information, the need to collect, store, curate, integrate, and analyze data in a way that could help inter-institutional and interdisciplinary collaboration has gained importance for the advancement of science in the twenty-first century. According to Birnholtz and Bietz's study (2003, p. 339), data is an evidence for validation of scientific contribution and it makes a social contribution to the establishment of practice. Therefore, understanding the importance of the data is vital to design, sustain and curate well-structured research data management syst...