2017
DOI: 10.1002/asi.23873
|View full text |Cite
|
Sign up to set email alerts
|

Predicting data science sociotechnical execution challenges by categorizing data science projects

Abstract: The challenge in executing a data science project is more than just identifying the best algorithm and tool set to use. Additional sociotechnical challenges include items such as how to define the project goals and how to ensure the project is effectively managed. This paper reports on a set of case studies where researchers were embedded within data science teams and where the researcher observations and analysis was focused on the attributes that can help describe data science projects and the challenges fac… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
19
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 35 publications
(21 citation statements)
references
References 18 publications
0
19
0
Order By: Relevance
“…In a further study, Saltz et al [17] could label data science projects with two dimensions (infrastructure and discovery), based on which they could identify four different types of data science projects depending on where projects could be placed on the axes. The project types were: Hard to Justify, Exploratory, Well-Defined and Small data.…”
Section: Background and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In a further study, Saltz et al [17] could label data science projects with two dimensions (infrastructure and discovery), based on which they could identify four different types of data science projects depending on where projects could be placed on the axes. The project types were: Hard to Justify, Exploratory, Well-Defined and Small data.…”
Section: Background and Related Workmentioning
confidence: 99%
“…There are only few (e.g. [10,17]) data science interview studies over multiple companies. Kandel et al [10] concentrate on individual analyst skill set and workflow mentioning within team collaboration briefly.…”
Section: Background and Related Workmentioning
confidence: 99%
“…However, the 4 Vs are sometimes not sufficient to describe a project. Hence, we focus on Saltz et al's [37] characterization of data science projects, in which there are two key dimensions. One key attribute is the level of discovery and the other is the level of technical infrastructure required for the project (this second attribute includes, from a project management perspective, the four Vs).…”
Section: Technical Factorsmentioning
confidence: 99%
“…It described a methodology based on IT value theory and workgroup ideation, guiding big data idea generation, idea assessment and implementation management. Another study [28] Other management challenges discussed [29] are that managers who track business intelligence and analytics costs believe they have already established a higher degree of big data implementations. Another paper [30] discussed big data-driven innovations as key in improving healthcare system sustainability through public-private partnerships (PPPs).…”
Section: Data Storage and Transport (Dst 27mentioning
confidence: 99%
“…Several studies [5,18] discussed the big data validation challenges. One main challenge is determining if the data to be analyzed have no issues, such as mismatching or missing values, needing cleansing, or determining if the data meets quality needs [28]. We noted that the acceptable level of data quality depends on the purpose of that analysis.…”
Section: Data Storage and Transport (Dst 27mentioning
confidence: 99%