2018
DOI: 10.5070/t5111035892
|View full text |Cite
|
Sign up to set email alerts
|

The fivethirtyeight R Package: "Tame Data" Principles for Introductory Statistics and Data Science Courses

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 10 publications
(9 citation statements)
references
References 8 publications
(9 reference statements)
0
9
0
Order By: Relevance
“…The need to bring real secondary data into the classroom generates a need to prepare that data so that it is suitable for addressing learning objectives. Kim et al (2018) discuss the continuum from "raw" to "clean," and note that, for some objectives (such as learning to clean data), perfectly clean is too much and raw is not enough. The happy medium is "tame" data (their terminology) that is just clean enough to fill its intended educational role.…”
Section: Discussionmentioning
confidence: 99%
“…The need to bring real secondary data into the classroom generates a need to prepare that data so that it is suitable for addressing learning objectives. Kim et al (2018) discuss the continuum from "raw" to "clean," and note that, for some objectives (such as learning to clean data), perfectly clean is too much and raw is not enough. The happy medium is "tame" data (their terminology) that is just clean enough to fill its intended educational role.…”
Section: Discussionmentioning
confidence: 99%
“…Note that the R programming environment has the appropriate package "The fivethirtyeight R", which facilitates the use of Twitter resources FiveThirtyEight in data courses [4]. We added some tweeters to the recommendation list, including Ukrainian resources as well as resources related to population censuses.…”
Section: Conversational Stylementioning
confidence: 99%
“…The data is first organized into "tidy data" (Wickham 2014) and then further wrangled using step-wise piping with a split-apply-combine strategy for mutating new variables (Wickham 2011). Tidy data shouldn't be confused with "tame data, " which Kim, Ismay, and Chunn (2018) coined to refer to textbook datasets suitable for teaching, particularly teaching statistics. The resulting (tame) data is provided in a new R package called yowie, which includes the code so the process is reproducible and could be used to further refresh the data as new records are made available in the NLSY79 database.…”
Section: Introductionmentioning
confidence: 99%