2017
DOI: 10.24017/science.2017.3.18
|View full text |Cite
|
Sign up to set email alerts
|

Successful Data Science Projects: Lessons Learned from Kaggle Competition

Abstract: Abstract:The workflow from data understanding to deployment of an analytical model of a data science project begins at framing the problem at hand, a task that is typically business-oriented and requires humanto-human interaction. However, the next three steps: data understanding, feature extraction, and model building that come next in the pipeline are the key to successful data science projects. Failing to fully understand the requirements of each of these three steps can negatively affect the performance of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…If any of the data preprocessing phases is not correctly handled, machine-learning algorithms will not run or can give misleading results [2]. Having in mind the significance of data preprocessing to data science [3], this study discusses data preprocessing and how we applied it to Flight MH370 social data.…”
Section: Introductionmentioning
confidence: 99%
“…If any of the data preprocessing phases is not correctly handled, machine-learning algorithms will not run or can give misleading results [2]. Having in mind the significance of data preprocessing to data science [3], this study discusses data preprocessing and how we applied it to Flight MH370 social data.…”
Section: Introductionmentioning
confidence: 99%