Automatic correction is an important research field in natural language processing, it is still relatively weak in the text automatic correction technology on semantic level. This paper presents develop XML rules based on LanguageTool Chinese grammar correction, match error correction in semantic level for the content of the rule to write the corresponding rule base, and realize automatic detect the corresponding content of semantic error in the text, and put forward the corresponding modification suggestions and opinions. The experiments show a high correct rate that the process of correction the corpus, which shows that the research and implementation of semantic of Chinese text is very practical significance and valuable. The introduction Background and significance of research. Text is an important carrier of human social information. With the rapid development of whole society information process, the importance and urgency of text information correction is more and more obvious. The former researchers in the text of the technology has made great achievements, but their check wrong technology just based on the word level, dealing with more words, less word or wrong character correction in the text. If testing an essay about the text of population statistics in China, the correct expression is "我国人口统计(不 包括台湾、香港、澳门)"("the population statistics of our country (excluding Taiwan, Hongkong, Macao") but appear the "大陆人口统计"("the population statistics of mainland") in the text. If do not found this mistake and report it out, which has violated China's political class information what appeared semantic collocation error. It is difficult to estimate the speed of network transmission, which will bring many negative effects to the society. Semantic errors, text automatic error-detection by word level is unable to achieve the error correction on semantic level, so the research and implementation of Chinese text semantic correction is a key and an essential part of Chinese text automatic correction development. The research status at foreign. In the early 1960s, foreign carried out the study of English text automatic correction; development to today, its technology is very mature already. Because of the English text use the spaces between words and words for separators, their automatic correction is the core of the word, word-error can be of two types, one is non-word error, the other is real-word error [1]. The study found that the non-word errors in the English text accounted for 60%, the real-word error accounted for 40% [2]. The non-word errors that the string is not exist in dictionary [3] ; Real-word error that string is exist in the dictionary [3] , but it with the context collocation error, which cause syntactic semantic error, so the real-word error is the semantic error. Traditional real-word error detection method mainly has two types: based on the traditional method of natural language processing and the method based on statistical language model [4]. It has been studying and improving the technology of semanti...
Ontology construction is a hot topic in semantic research.This paper proposes constructing the domain ontology of political sensitive Information with Political Information as knowledge source data.By the using of ontology itself contains the description logic consistency reasoning verification mechanism, the constructed ontology library can be applied to multiple fields, such as performance evaluation, marking system, semantic network retrieval semantic proofreading and so on. It shows that the research and implementation of the construction of sensitive information ontology based on Protege and semantic reasoning has important practical significance and extensive application prospect. ① abroad, according to ontology construction method chanting time to sort are : IDEF4 method;
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.