2016
DOI: 10.1037/met0000111
|View full text |Cite
|
Sign up to set email alerts
|

A practical guide to big data research in psychology.

Abstract: The massive volume of data that now covers a wide variety of human behaviors offers researchers in psychology an unprecedented opportunity to conduct innovative theory- and data-driven field research. This article is a practical guide to conducting big data research, covering data management, acquisition, processing, and analytics (including key supervised and unsupervised learning data mining methods). It is accompanied by walkthrough tutorials on data acquisition, text analysis with latent Dirichlet allocati… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
145
0
2

Year Published

2017
2017
2020
2020

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 127 publications
(159 citation statements)
references
References 61 publications
(103 reference statements)
0
145
0
2
Order By: Relevance
“…Classification involves identifying the category where an observation belongs, given known category labels 20. Logistic regression is an example of a classifier from statistics.…”
Section: Big Data Analysis Techniquesmentioning
confidence: 99%
“…Classification involves identifying the category where an observation belongs, given known category labels 20. Logistic regression is an example of a classifier from statistics.…”
Section: Big Data Analysis Techniquesmentioning
confidence: 99%
“…Given that concealing negative emotions may be a particular concern among men (Nadeau et al, 2016), and given the relatively low participation of men in most psychology convenience samples, the possibility of oversampling male users may be a benefit rather than a limitation of Reddit analyses. Furthermore, the site's use of upvotes and downvotes (or "karma") tends to discourage most everyday users-that is, people not using dedicated "trolling" accounts-from behaving more antisocially than they would in real life (Barthel et al, 2016;Chen and Wojcik, 2016).…”
Section: Introductionmentioning
confidence: 99%
“…Within the field of IB in particular, researchers routinely deal with ''Big Data,'' that is, large amounts of information stored in archival datasets (Harlow & Oswald, 2016). Because these datasets were not collected directly in response to a particular research question, they contain many variables that can be restructured to produce ''favorable'' results (i.e., better fit estimates, larger effect-size estimates) (Chen & Wojcik, 2016). For example, consider the case of firm performance, which is one of the most frequently measured constructs in IB.…”
Section: Selection Of Variables To Include In a Modelmentioning
confidence: 99%