2021
DOI: 10.2478/dim-2021-0005
|View full text |Cite
|
Sign up to set email alerts
|

A Pattern and POS Auto-Learning Method for Terminology Extraction from Scientific Text

Abstract: A lot of new scientific documents are being published on various platforms every day. It is more and more imperative to quickly and efficiently discover new words and meanings from these documents. However, most of the related works rely on labeled data, and it is quite difficult to deal with unlabeled new documents efficiently. For this, we have introduced an unsupervised method based on sentence patterns and part of speech (POS) sequences. Our method just needs a few initial learnable patterns to obtain the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 9 publications
0
3
0
Order By: Relevance
“…The creation of methods for extracting terms using unsupervised learning models is one of the topical contemporary issues. For example, in [19] the authors propose unsupervised technology for extracting concepts, while considering the context of using the concept in terms of parts of speech (POS), which makes it possible to make a model that is universal for different subject areas.…”
Section: Concepts and Relationships Extraction From Natural Language ...mentioning
confidence: 99%
“…The creation of methods for extracting terms using unsupervised learning models is one of the topical contemporary issues. For example, in [19] the authors propose unsupervised technology for extracting concepts, while considering the context of using the concept in terms of parts of speech (POS), which makes it possible to make a model that is universal for different subject areas.…”
Section: Concepts and Relationships Extraction From Natural Language ...mentioning
confidence: 99%
“…to match in the corpus and output multicharacter units that meet the established rules as terms. The common term extraction models mainly focus on language features 8 , 9 , syntactic patterns 10 – 12 , and retrieval strategies 13 . The advantages of the method include being concise, intuitive, and having a strong expressive ability.…”
Section: Theoretical Backgroundmentioning
confidence: 99%
“…The paper "A Pattern and POS Auto-Learning Method for Terminology Extraction from Scientific Text" (Shao, Hua, & Song, 2021) proposed an unsupervised method based on sentence patterns and part of speech (POS) sequences extracted from scientific texts. The proposed method only requires a few initial learnable patterns to obtain initial terminological tokens and their POS sequences.…”
mentioning
confidence: 99%