Abstract. SHIRI1 is an ontology-based system for integration of semistructured documents related to a specific domain. The system's purpose is to allow users to access to relevant parts of documents as answers to their queries. SHIRI uses RDF/OWL for representation of resources and SPARQL for their querying. It relies on an automatic, unsupervised and ontology-driven approach for extraction, alignment and semantic annotation of tagged elements of documents. In this paper, we focus on the Extract-Align algorithm which exploits a set of named entity and term patterns to extract term candidates to be aligned with the ontology. It proceeds in an incremental manner in order to populate the ontology with terms describing instances of the domain and to reduce the access to extern resources such as Web. We experiment it on a HTML corpus related to call for papers in computer science and the results that we obtain are very promising. These results show how the incremental behaviour of Extract-Align algorithm enriches the ontology and the number of terms (or named entities) aligned directly with the ontology increases.
Abstract. This paper presents a sociocultural knowledge ontology (OntoSOC) modeling approach. Onto-SOC modeling approach is based on Engeström"s Human Activity Theory (HAT). That Theory allowed us to identify fundamental concepts and relationships between them. The top-down precess has been used to define differents sub-concepts. The modeled vocabulary permits us to organise data, to facilitate information retrieval by introducing a semantic layer in social web platform architecture, we project to implement. This platform can be considered as a « collective memory » and Participative and Distributed Information System (PDIS) which will allow Cameroonian communities to share an co-construct knowledge on permanent organized activities.
Data integration involves combining data residing in different sources and providing users with a unified view of these data through what is called a "global schema''. The authors address here the problem of the construction of this global schema, with a minimum human effort, in the semantic Web context where data sources are annotated with ontologies. The authors aim to facilitate the task of building a common vocabulary (ontology) that will serve as a shared conceptual level for several heterogeneous data sources needing to share their data in a specific application domain. The authors propose a solution based on the use of a domain reference ontology (or "background knowledge'') as a mediation support and some reasoning techniques in order to minimize human intervention. The work presented here is implemented as a prototype for Semi Automatic Global Ontology Building (SAGOB).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.