2009
DOI: 10.1007/978-3-642-04174-7_27
|View full text |Cite
|
Sign up to set email alerts
|

Bi-directional Joint Inference for Entity Resolution and Segmentation Using Imperatively-Defined Factor Graphs

Abstract: Abstract. There has been growing interest in using joint inference across multiple subtasks as a mechanism for avoiding the cascading accumulation of errors in traditional pipelines. Several recent papers demonstrate joint inference between the segmentation of entity mentions and their de-duplication, however, they have various weaknesses: inference information flows only in one direction, the number of uncertain hypotheses is severely limited, or the subtasks are only loosely coupled. This paper presents a hi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0
2

Year Published

2010
2010
2013
2013

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 19 publications
(17 citation statements)
references
References 12 publications
0
15
0
2
Order By: Relevance
“…Every pair of consecutive samples in the MCMC chain is ranked according to the model and the ground truth, and the parameters are updated when the rankings disagree. This allows the learner to acquire more supervision per instance, and has led to efficient training for models in which inference is expensive and generally intractable [23].…”
Section: Rank-based Learning and Distant Supervisionmentioning
confidence: 99%
“…Every pair of consecutive samples in the MCMC chain is ranked according to the model and the ground truth, and the parameters are updated when the rankings disagree. This allows the learner to acquire more supervision per instance, and has led to efficient training for models in which inference is expensive and generally intractable [23].…”
Section: Rank-based Learning and Distant Supervisionmentioning
confidence: 99%
“…does not contain rules referring to specific strings occurring in the data, which achieves an AURPC of .971 [40]. Note that the MLN based approach in [40] -as well as more recent approaches achieving still higher accuracy [33,39] -perform collective classification, and therefore can exploit the fact that the binary relation on bibliographic records that one predicts is an equivalence relation. The two classification models we have used both perform independent predictions for each pair of bibliographic records, and therefore cannot be expected to achieve results that are competitive with state-of-the-art collective approaches.…”
Section: Coramentioning
confidence: 99%
“…The two classification models we have used both perform independent predictions for each pair of bibliographic records, and therefore cannot be expected to achieve results that are competitive with state-of-the-art collective approaches. It should be emphasized, though, that in [33,39] the MLN structure (i.e. the set of logical formulae) was carefully designed by hand, while in our experiments the TET structure is learned from data.…”
Section: Coramentioning
confidence: 99%
“…The dataset References origins in a domain that is very popular for the evaluation of novel IE techniques (cf. [1,11,12,13]), whereas the dataset Curricula Vitae belongs to classical IE problems of template extraction.…”
Section: Datasetsmentioning
confidence: 99%
“…The work of Peng et al [11] provides a deep analysis of different settings and established linear-chain CRFs as the state-of-the-art for the segmentation of references. Approaches for joint inference [12,13] combine different tasks within a model. Here, the accuracy of the labeling can be increased when entity resolution and segmentation are jointly performed.…”
Section: Related Workmentioning
confidence: 99%