Md-Mizanur RAHOMAN†a) , Nonmember and Ryutaro ICHISE †, † †b) , Member SUMMARY Keyword-based linked data information retrieval is an easy choice for general-purpose users, but the implementation of such an approach is a challenge because mere keywords do not hold semantic information. Some studies have incorporated templates in an effort to bridge this gap, but most such approaches have proven ineffective because of inefficient template management. Because linked data can be presented in a structured format, we can assume that the data's internal statistics can be used to effectively influence template management. In this work, we explore the use of this influence for template creation, ranking, and scaling. Then, we demonstrate how our proposal for automatic linked data information retrieval can be used alongside familiar keyword-based information retrieval methods, and can also be incorporated alongside other techniques, such as ontology inclusion and sophisticated matching, in order to achieve increased levels of performance. key words: linked data, keyword, information access, data statistics
IntroductionThe Linked Open Data [4] initiative, where data are connected in a network-like structure [7] and which was motivated by the potential for link construction and identification among various data, has opened new worlds in data usage. The concept of this storage paradigm deviates from traditional repository-centric infrastructures to an open publishing model that allows other applications to access and interpret stored data [16]. As of September 2011, 295 knowledge-bases consisting of over 31 billion resource document framework (RDF) triples on various domains, have become interlinked via approximately 504 million RDF links * . Efficient and easy-to-use information access over linked data is now a necessity because these days such linked data hold vast amounts of knowledge. Usually, obtaining information access over a linked data network requires following links [2], [3], [15]. However, simply following links introduces a very basic problem, which is that the use of a network presentation makes it very hard to find endpoints, at least within a reasonable cost [1]. As a result, finding links on linked data is often difficult, especially for general-purpose users who have very little knowledge about the internal structure of linked data, such as schema infor- . However, such data access options are different from other traditional keyword-based data retrieval types because they require adapting keywords to semantics. Since keywords do not contain semantic information (specifically ontology information), a number of researchers have proposed automatic ontology inclusion [28] to bridge that gap. However, automatic ontology inclusion is a challenge because, in such cases, the system itself needs to incorporate ontology that is, as yet, unavailable.In attempts to resolve the abovementioned problems, such as link finding and keyword semantics inclusion, recently, a number of other researchers have worked to incorpora...