Luis Galárraga scite author profile

Recent advances in information extraction have led to huge knowledge bases (KBs), which capture knowledge in a machine-readable format. Inductive Logic Programming (ILP) can be used to mine logical rules from these KBs, such as "If two persons are married, then they (usually) live in the same city". While ILP is a mature field, mining logical rules from KBs is difficult, because KBs make an open world assumption. This means that absent information cannot be taken as counterexamples. Our approach AMIE [16] has shown how rules can be mined effectively from KBs even in the absence of counterexamples. In this paper, we show how this approach can be optimized to mine even larger KBs with more than 12M statements. Extensive experiments show how our new approach, AMIE+, extends to areas of mining that were previously beyond reach.

show abstract

Canonicalizing Open Knowledge Bases

Galárraga

Heitz

Murphy

et al. 2014

View full text Add to dashboard Cite

Open information extraction approaches have led to the creation of large knowledge bases from the Web. The problem with such methods is that their entities and relations are not canonicalized, leading to redundant and ambiguous facts. For example, they may store Barack Obama, was born in, Honolulu and Obama, place of birth, Honolulu . In this paper, we present an approach based on machine learning methods that can canonicalize such Open IE triples, by clustering synonymous names and phrases.We also provide a detailed discussion about the different signals, features and design choices that influence the quality of synonym resolution for noun phrases in Open IE KBs, thus shedding light on the middle ground between "open" and "closed" information extraction systems.

show abstract

Predicting Completeness in Knowledge Bases

Galárraga

Razniewski

Amarilli

et al. 2017

View full text Add to dashboard Cite

Knowledge bases such as Wikidata, DBpedia, or YAGO contain millions of entities and facts. In some knowledge bases, the correctness of these facts has been evaluated. However, much less is known about their completeness, i.e., the proportion of real facts that the knowledge bases cover. In this work, we investigate different signals to identify the areas where a knowledge base is complete. We show that we can combine these signals in a rule mining approach, which allows us to predict where facts may be missing. We also show that completeness predictions can help other applications such as fact prediction.

show abstract

Fast and Exact Rule Mining with AMIE 3

Lajus

Galárraga

Suchanek

2020

View full text Add to dashboard Cite

VICKEY: Mining Conditional Keys on Knowledge Bases

Symeonidou

Galárraga

Pernelle

et al. 2017

View full text Add to dashboard Cite

A conditional key is a key constraint that is valid in only a part of the data. In this paper, we show how such keys can be mined automatically on large knowledge bases (KBs). For this, we combine techniques from key mining with techniques from rule mining. We show that our method can scale to KBs of millions of facts. We also show that the conditional keys we mine can improve the quality of entity linking by up to 47 percentage points.

show abstract

Mining rules to align knowledge bases

Galárraga

Preda

Suchanek

2013

View full text Add to dashboard Cite

The Semantic Web has made huge progress in the last decade, and now comprises hundreds of knowledge bases (KBs). The Linked Open Data cloud connects the KBs in this Web of data. However, the links between the KBs are mostly concerned with the instances, not with the schema. Aligning the schemas is not easy, because the KBs may differ not just in their names for relations and classes, but also in their inherent structure. Therefore, we argue in this paper that advanced schema alignment is needed to tie the Semantic Web together. We put forward a particularly simple approach to illustrate how that might look.

show abstract

Amie

et al. 2013

View full text Add to dashboard Cite

HiPaR: Hierarchical Pattern-Aided Regression

Galárraga

Pelgrin

Termier

2021

View full text Add to dashboard Cite

12 3 4

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Luis Galárraga

Fast rule mining in ontological knowledge bases with AMIE $$+$$ +

Canonicalizing Open Knowledge Bases

Predicting Completeness in Knowledge Bases

Fast and Exact Rule Mining with AMIE 3

VICKEY: Mining Conditional Keys on Knowledge Bases

Mining rules to align knowledge bases

Amie

HiPaR: Hierarchical Pattern-Aided Regression

Contact Info

Product

Resources

About