Proceedings of the 2013 Workshop on Automated Knowledge Base Construction 2013
DOI: 10.1145/2509558.2509571
|View full text |Cite
|
Sign up to set email alerts
|

A survey of noise reduction methods for distant supervision

Abstract: We survey recent approaches to noise reduction in distant supervision learning for relation extraction. We group them according to the principles they are based on: at-least-one constraints, topic-based models, or pattern correlations. Besides describing them, we illustrate the fundamental differences and attempt to give an outlook to potentially fruitful further research. In addition, we identify related work in sentiment analysis which could profit from approaches to noise reduction.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
43
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 48 publications
(47 citation statements)
references
References 14 publications
0
43
0
Order By: Relevance
“…In 2014, the top performing system, submitted by Stanford, achieved a recall of 0.277, a precision of 0.546 and a F1 score of 0.368 [5,7]. These results are somewhat similar to those submitted the previous year by the best performing system, Relation Factory, which achieved a recall of 0.332, precision of 0.425 and F1 score of 0.373 [4,8].…”
Section: Introductionmentioning
confidence: 61%
See 3 more Smart Citations
“…In 2014, the top performing system, submitted by Stanford, achieved a recall of 0.277, a precision of 0.546 and a F1 score of 0.368 [5,7]. These results are somewhat similar to those submitted the previous year by the best performing system, Relation Factory, which achieved a recall of 0.332, precision of 0.425 and F1 score of 0.373 [4,8].…”
Section: Introductionmentioning
confidence: 61%
“…For most systems, the slot filling task is done in two main steps: candidate generation and candidate validation [8]. The candidate generation stage can be further decomposed into the following steps: (i) query expansion, (ii) document retrieval and (iii) candidate matching.…”
Section: Example Of a Slot Filling System's Structurementioning
confidence: 99%
See 2 more Smart Citations
“…One of most crucial problems in distant supervision is the inherent errors in the automatically generated training data (Roth et al, 2013). Table 1 illustrates this problem with a toy example.…”
Section: Introductionmentioning
confidence: 99%