Unsupervised Learning of Link Discovery Configuration

Nikolov, Andriy; d’Aquin, Mathieu; Motta, Enrico

doi:10.1007/978-3-642-30284-8_15

Cited by 76 publications

(69 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Thus, learning link specifications can be combined with our approach. For example, specification templates can be used to seed genetic programming algorithms [10] such as to accelerate their convergence. In addition, knowing which template to use can help when choosing the right deterministic model (Boolean classifier, linear classifier) as well as its initialization for these models [8].…”

Section: Discussionmentioning

confidence: 99%

“…In the case of link discovery, X = S × T while Y = {+1, −1} with f (x i ) = +1 if ρ(s, t) and f (x i ) = −1 in all other cases. Finding the function f for link discovery tasks is generally very costly, as it requires either (mostly manually) labeled training data [9] or a significant amount of computation [10]. The idea behind transfer learning (also coined knowledge transfer) [11] can be broadly described as follows: Given other machine learning tasks t with known or unknown classification functions f that are somehow "related" to f , use the functions f or the domain knowledge available for determining f (i.e., transfer the knowledge from the tasks t ) to improve the process of finding (f, Y).…”

Section: B Transfer Learningmentioning

confidence: 99%

“…The basic observation underlying this paper is the following: While several approaches for learning link specifications have 1 http://latc-project.eu been developed over the past years (e.g., [9], [10]), the discovery of a specification to link two datasets has been regarded as an isolated process. Hence, to the best of our knowledge, none of the previous approaches to detecting link specifications has made use of the already available knowledge available in repositories for link specifications.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Towards Transfer Learning of Link Specifications

Ngomo

Lehmann

Hassan

2013

2013 IEEE Seventh International Conference on Semantic Computing

View full text Add to dashboard Cite

Abstract-Over the last years, link discovery frameworks have been employed successfully to create links between knowledge bases. Consequently, repositories of high-quality link specifications have been created and made available on the Web. The basic question underlying this work is the following: Can the specifications in these repositories be reused to ease the detection of link specifications between unlinked knowledge bases? In this paper, we address this question by presenting a formal transfer learning framework that allows detecting existing specifications that can be used as templates for specifying links between previously unlinked knowledge bases. We discuss both the advantages and the limitations of such an approach for determining link specifications. We evaluate our approach on a variety of link specifications from several domains and show that the detection of accurate link specifications for use as templates can be achieved with high reliability.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: B Transfer Learningmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Towards Transfer Learning of Link Specifications

Ngomo

Lehmann

Hassan

2013

2013 IEEE Seventh International Conference on Semantic Computing

View full text Add to dashboard Cite

show abstract

“…These approaches are often adapted to some set of data. In some cases, this type of rule is automatically discovered [14,3,2,7]. Other approaches are based on logical rules [8,17] that are generated automatically using the semantics of the keys or functional properties.…”

Section: State Of the Artmentioning

confidence: 99%

An Approach for Discovering and Maintaining Links in RDF Linked Data

Ardjani¹,

Bouchiha²,

Malki³

2017

IJMECS

View full text Add to dashboard Cite

Abstract-Many datasets are published on the Web using semantic Web technologies. These datasets contain data that represent links to similar resources. If these datasets are linked together by properly constructed links, users can easily query the data through a uniform interface, as if they were querying a single dataset. In this paper we propose an approach to discover (semi) automatically links between RDF data based on the description models that appear around the resources. Our approach also includes a (semi) automatic process to maintain links when a data-change occurs.

show abstract

“…Standard blocking approaches were implemented in the first versions of SILK and later replaced with MultiBlock [9], a lossless multidimensional blocking technique. KnoFuss [22] also implements blocking techniques to achieve acceptable runtimes. Further LD frameworks have been participated in the ontology alignment evaluation initiative [4].…”

Section: Related Workmentioning

confidence: 99%

When to Reach for the Cloud: Using Parallel Hardware for Link Discovery

Ngomo

Kolb

Heino

et al. 2013

The Semantic Web: Semantics and Big Data

View full text Add to dashboard Cite

Abstract. With the ever-growing amount of RDF data available across the Web, the discovery of links between datasets and deduplication of resources within knowledge bases have become tasks of crucial importance. Over the last years, several link discovery approaches have been developed to tackle the runtime and complexity problems that are intrinsic to link discovery. Yet, so far, little attention has been paid to the management of hardware resources for the execution of link discovery tasks. This paper addresses this research gap by investigating the efficient use of hardware resources for link discovery. We implement the HR 3 approach for three different parallel processing paradigms including the use of GPUs and MapReduce platforms. We also perform a thorough performance comparison for these implementations. Our results show that certain tasks that appear to require cloud computing techniques can actually be accomplished using standard parallel hardware. Moreover, our evaluation provides break-even points that can serve as guidelines for deciding on when to use which hardware for link discovery.

show abstract

Unsupervised Learning of Link Discovery Configuration

Cited by 76 publications

References 14 publications

Towards Transfer Learning of Link Specifications

Towards Transfer Learning of Link Specifications

An Approach for Discovering and Maintaining Links in RDF Linked Data

When to Reach for the Cloud: Using Parallel Hardware for Link Discovery

Contact Info

Product

Resources

About