2020
DOI: 10.1371/journal.pone.0233154
|View full text |Cite
|
Sign up to set email alerts
|

Enhancing big data in the social sciences with crowdsourcing: Data augmentation practices, techniques, and opportunities

Abstract: Proponents of big data claim it will fuel a social research revolution, but skeptics challenge its reliability and decontextualization. The largest subset of big data is not designed for social research. Data augmentation-systematic assessment of measurement against known quantities and expansion of extant data with new information-is an important tool to maximize such data's validity and research value. Using trained research assistants or specialized algorithms are common approaches to augmentation but may n… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(15 citation statements)
references
References 60 publications
(74 reference statements)
0
13
0
Order By: Relevance
“…5.1. Collective Impact Regional Data Hubs, like 2-1-1 San Diego's CIE, Offer New Sources for High-Quality Quantitative Data on the Homeless Population, That Could Be Used to Replicate Study Findings and Expand Research of the Homeless Population in Other Geographies Future studies could explore the use of collective impact hub data [18][19][20] for advanced statistical quantitative research on the experiences of homelessness in other geographies.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…5.1. Collective Impact Regional Data Hubs, like 2-1-1 San Diego's CIE, Offer New Sources for High-Quality Quantitative Data on the Homeless Population, That Could Be Used to Replicate Study Findings and Expand Research of the Homeless Population in Other Geographies Future studies could explore the use of collective impact hub data [18][19][20] for advanced statistical quantitative research on the experiences of homelessness in other geographies.…”
Section: Discussionmentioning
confidence: 99%
“…Relatively few studies have used machine learning techniques like cluster analysis and decision trees to examine homelessness given the historic lack of quantitative data on those categorized as homeless [16]. The advent of the Collective Impact movement has furthered opportunities for quantitative research in this area through the development of more sophisticated data collection, storage, and management systems in social agencies [18][19][20]. Applying more advanced analytics strategies to data gathered from these shared data systems will provide opportunities to validate and expand on the insightful early research seeking to understand and alleviate homelessness in America.…”
Section: Recent Advances In Data Science Approaches To Studying Socia...mentioning
confidence: 99%
“…We chose Amazon Mechanical Turk (AMT) as our web-based platform as it would facilitate rapid, large-scale participant recruitment [ 8 ]. Importantly, AMT has become an increasingly accepted means of collecting responses from diverse participants [ 9 ].…”
Section: Methodsmentioning
confidence: 99%
“…8 Originally developed to outsource tasks such as market research and media transcription, MTurk has evolved into a recruitment channel for marketing and behavioral science research as evidenced by a study showing that a third of all MTurk tasks originated from academic research groups. 9,10 We surveyed individuals with recent experiences obtaining COVID-19 testing to characterize their behavior related to the notification of testing results to close contacts, and receptiveness of digital contact tracing efforts. The survey inquired about the following: demographic data, details of COVID-19 testing, experiences regarding the notification of test results to close contacts, experiences with public health contact tracing efforts, and their opinions of digital contact tracing services.…”
Section: Survey Designmentioning
confidence: 99%