2013
DOI: 10.1007/s10618-013-0306-1
|View full text |Cite
|
Sign up to set email alerts
|

Repeated labeling using multiple noisy labelers

Abstract: This paper addresses the repeated acquisition of labels for data items when the labeling is imperfect.We examine the improvement (or lack thereof) in data quality via repeated labeling, and focus especially on the improvement of training labels for supervised induction. With the outsourcing of small tasks becoming easier, for example via Amazon's Mechanical Turk, it often is possible to obtain less-than-expert labeling at low cost. With low-cost labeling, preparing the unlabeled part of the data can become con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

1
116
0
1

Year Published

2015
2015
2020
2020

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 144 publications
(118 citation statements)
references
References 40 publications
1
116
0
1
Order By: Relevance
“…Redundancy of information access has been shown, under certain conditions, to increase classification accuracy of web documents [22], an agent's trust in a piece of information [23] [24], the likelihood of winning a game where an increased pool of knowledge increases odds of success [25] and accuracy on classification problems using voters from Mechanical Turk [26].…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Redundancy of information access has been shown, under certain conditions, to increase classification accuracy of web documents [22], an agent's trust in a piece of information [23] [24], the likelihood of winning a game where an increased pool of knowledge increases odds of success [25] and accuracy on classification problems using voters from Mechanical Turk [26].…”
Section: Related Workmentioning
confidence: 99%
“…Of particular interest to our study is the work of [26], who consider how best to use a constrained set of human labelers whose abilities are initially unknown to perform a standard machine learning classification problem. First, [26] discuss the existence of a cost in adding additional human labelers, showing that redundancy of labeling can be both very cost efficient and cost inefficient, depending on how it is implemented.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations