The paper introduces a new solution for semantic analysis implementation in modern enterprise content management (ECM) systems. The system of semantic analysis is intended for the intellectual analysis of enterprise official and technical documents based on machine learning, namely the extraction of the specified attributes from them for further use. In this paper it is proposed to implement semantic search using the extracted data configurator, which is responsible for creating and managing ontologies. From the configurator of the extracted data by the name of the document type, a graph is generated containing attributes to be extracted (official terms and sections, dates, etc.), regular expressions to search for sentences that probably contain the desired attribute, Yargy and regular rules for extracting attributes from the arrays of sentences. The proposed solution was successfully probated and tested on a dataset containing engineering enterprise contract agreements and protocols.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.