2021
DOI: 10.1007/s40747-021-00343-8
|View full text |Cite
|
Sign up to set email alerts
|

A patent keywords extraction method using TextRank model with prior public knowledge

Abstract: For large amount of patent texts, how to extract their keywords in an unsupervised way is a very important problem. In existing methods, only the own information of patent texts is analyzed. In this study, an improved TextRank model is proposed, in which prior public knowledge is effectively utilized. Specifically, two following points are first considered: (1) a TextRank network is constructed for each patent text, (2) a prior knowledge network is constructed based on public dictionary data, in which network … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 23 publications
(7 citation statements)
references
References 31 publications
(33 reference statements)
0
6
0
Order By: Relevance
“…The supporting information for the network of keywords is shown in Table 7. Every keyword is represented by a node, where the size of the node depicts its occurrence (Huang and Xie, 2022). It means the higher the size of the node, the greater the occurrences of the keyword.…”
Section: Bibliometric Analysis and Resultsmentioning
confidence: 99%
“…The supporting information for the network of keywords is shown in Table 7. Every keyword is represented by a node, where the size of the node depicts its occurrence (Huang and Xie, 2022). It means the higher the size of the node, the greater the occurrences of the keyword.…”
Section: Bibliometric Analysis and Resultsmentioning
confidence: 99%
“…TextRank algorithm [13] , derived from PageRank, is a typical graph-based keyword extraction method. It extracts keywords by using co-occurrence information between words in documents, and extracts key sentences of the text by using extractive automatic abstracting method.…”
Section: Keyword Extractionmentioning
confidence: 99%
“…However, it is poorly effective for new words and words that are not included on Wikipedia. Huang Z [10] constructed a prior knowledge network by using public dictionary data and improved the scoring function in the TextRank network by using some prior relations in this network. Kazemi A [11] optimized the state transfer matrix by randomly resetting the weights during iterations of the TextRank graph, which improved the effect of the keyword extraction.…”
Section: Related Workmentioning
confidence: 99%
“…The final node scores are only related to the state transfer matrix between nodes rather than the initial values of nodes. The final state transfer matrix of the nodes in the optimized word graph is shown in Formula (10), where P ij denotes the probability of node v i transferring to v j . n is the number of nodes.…”
Section: F Word Features A) P Cover V I V J : Textrank Initial Node ...mentioning
confidence: 99%