“…Typically, a data science project is described as a project that uses statistical and machine‐learning techniques on large volumes of unstructured and/or structured data generated by systems, people, sensors, or digital traces of information from people. This work is done in a distributed computing environment with a goal to identify correlations and causal relationships, classify and predict events, identify patterns and anomalies, and infer probabilities, interest, and sentiment (Das, Cui, Campbell, Agrawal, & Ramnath, ). Big Data is often thought of as a subset of data science, where the amount of data requires the use of special tools and algorithms.…”