Repeated labeling using multiple noisy labelers

Ipeirotis, Panagiotis G.; Provost, Foster; Sheng, Victor S.; Wang, Jing

doi:10.1007/s10618-013-0306-1

Cited by 144 publications

(118 citation statements)

References 40 publications

Supporting

Mentioning

116

Contrasting

Unclassified

Order By: Relevance

“…Redundancy of information access has been shown, under certain conditions, to increase classification accuracy of web documents [22], an agent's trust in a piece of information [23] [24], the likelihood of winning a game where an increased pool of knowledge increases odds of success [25] and accuracy on classification problems using voters from Mechanical Turk [26].…”

Section: Related Workmentioning

confidence: 99%

“…Of particular interest to our study is the work of [26], who consider how best to use a constrained set of human labelers whose abilities are initially unknown to perform a standard machine learning classification problem. First, [26] discuss the existence of a cost in adding additional human labelers, showing that redundancy of labeling can be both very cost efficient and cost inefficient, depending on how it is implemented.…”

Section: Related Workmentioning

confidence: 99%

“…First, [26] discuss the existence of a cost in adding additional human labelers, showing that redundancy of labeling can be both very cost efficient and cost inefficient, depending on how it is implemented. The cost in their study is monetary, but correlates to the cost of reducing incorrect information at the expense of quickly spreading correct information.…”

Section: Related Workmentioning

confidence: 99%

“…We see similar trade-off issues in our results. Secondly, [26] note that as the number of labelers giving an answer for a specific classification problem increases, conflict is naturally more likely to occur. In a similar vein, [27] shows that high levels of task redundancy (the number of people doing the same thing) in an organization may decrease performance, in part because it leads to conflicting results.…”

Section: Related Workmentioning

confidence: 99%

“…Based on this, we postulated that in a network with highly redundant access to information, knowledge will spread quickly in a stable environment, but much slower in an environment that produces a significant amount of conflicting knowledge. Finally, we make use of work in [26] on the concept of a probability-distribution based approach to how certain their classifier should be about a particular problem. We utilize this method for computing uncertainty when we represent how "uncertain" an agent is about the knowledge he holds.…”

Section: Related Workmentioning

confidence: 99%

See 4 more Smart Citations

Simulating Diffusion with Conflicting Knowledge

Joseph

Carley

2012

SSRN Journal

View full text Add to dashboard Cite

assistance on the ideas presented in this work.Keywords: Agent-based modeling, Diffusion Processes, Decision-making. AbstractA simulation model is developed and used to better understand the diffusion of conflicting knowledge in a social network. Within the simulation, correct and incorrect knowledge diffuse simultaneously, however, simulation agents are not explicitly made aware that contradictory information exists. Instead, they may become aware of this by obtaining both correct and incorrect knowledge of the same piece of information. The agent must then determine what knowledge to accept. We implement various models of how people react to conflicting knowledge, which we refer to as intrapersonal conflict resolution strategies (ICRS). We analyze these ICRS with respect to the speed and breadth of correct and incorrect knowledge spread and how often agents find themselves holding conflicting knowledge. We also consider the effect of varying the odds of incorrect knowledge existing in the system, the network structure and the differentials in trust between agents. We find evidence that our model is consistent with many real-world notions of diffusion. Consequently, we make tentative conclusions about the effect of our various experimental conditions on the spread of conflicting knowledge in a social network. We note that conflicting knowledge spreads more slowly and persists longer than knowledge which cannot be conflicted.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 3 more Smart Citations

Simulating Diffusion with Conflicting Knowledge

Joseph

Carley

2012

SSRN Journal

View full text Add to dashboard Cite

show abstract

Average Jane, Where Art Thou? – Recent Avenues in Efficient Machine Learning Under Subjectivity Uncertainty

Rizos

Schuller

2020

Information Processing and Management of Uncertainty in Knowledge-Based Systems

View full text Add to dashboard Cite

RuBQ: A Russian Dataset for Question Answering over Wikidata

Korablinov

Braslavski

2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

The paper presents RuBQ, the first Russian knowledge base question answering (KBQA) dataset. The high-quality dataset consists of 1,500 Russian questions of varying complexity, their English machine translations, SPARQL queries to Wikidata, reference answers, as well as a Wikidata sample of triples containing entities with Russian labels. The dataset creation started with a large collection of question-answer pairs from online quizzes. The data underwent automatic filtering, crowdassisted entity linking, automatic generation of SPARQL queries, and their subsequent in-house verification. The freely available dataset will be of interest for a wide community of researchers and practitioners in the areas of Semantic Web, NLP, and IR, especially for those working on multilingual question answering. The proposed dataset generation pipeline proved to be efficient and can be employed in other data annotation projects.

show abstract

Repeated labeling using multiple noisy labelers

Cited by 144 publications

References 40 publications

Simulating Diffusion with Conflicting Knowledge

Simulating Diffusion with Conflicting Knowledge

Average Jane, Where Art Thou? – Recent Avenues in Efficient Machine Learning Under Subjectivity Uncertainty

RuBQ: A Russian Dataset for Question Answering over Wikidata

Contact Info

Product

Resources

About