Ontology alignment plays a critical role in knowledge integration and has been widely investigated in the past decades. State of the art systems, however, still have considerable room for performance improvement especially in dealing with new (industrial) alignment tasks. In this paper we present a machine learning based extension to traditional ontology alignment systems, using distant supervision for training, ontology embedding and Siamese Neural Networks for incorporating richer semantics. We have used the extension together with traditional systems such as LogMap and AML to align two food ontologies, HeLiS and FoodOn, and we found that the extension recalls many additional valid mappings and also avoids some false positive mappings. This is also verified by an evaluation on alignment tasks from the OAEI conference track.
Ontology Matching (OM) plays an important role in many domains such as bioinformatics and the Semantic Web, and its research is becoming increasingly popular, especially with the application of machine learning (ML) techniques. Although the Ontology Alignment Evaluation Initiative (OAEI) represents an impressive effort for the systematic evaluation of OM systems, it still suffers from several limitations including limited evaluation of subsumption mappings, suboptimal reference mappings, and limited support for the evaluation of ML-based systems. To tackle these limitations, we introduce five new biomedical OM tasks involving ontologies extracted from Mondo and UMLS. Each task includes both equivalence and subsumption matching; the quality of reference mappings is ensured by human curation, ontology pruning, etc.; and a comprehensive evaluation framework is proposed to measure OM performance from various perspectives for both ML-based and non-ML-based OM systems. We report evaluation results for OM systems of different types to demonstrate the usage of these resources, all of which are publicly available as part of the new Bio-ML track at OAEI 2022.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.