2021
DOI: 10.1049/cje.2021.05.007
|View full text |Cite
|
Sign up to set email alerts
|

Keyword Extraction from Scientific Research Projects Based on SRP‐TF‐IDF

Abstract: Keyword extraction by Term frequency-Inverse document frequency (TF-IDF) is used for text information retrieval and mining in many domains, such as news text, social contact text, and medical text. However, keyword extraction in special domains still needs to be improved and optimized, particularly in the scientific research field. The traditional TF-IDF algorithm considers only the word frequency in documents, but not the domain characteristics. Therefore, we propose the Scientific research project TF-IDF (SR… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 23 publications
(8 citation statements)
references
References 17 publications
0
7
0
Order By: Relevance
“…TF-IDF algorithm is a classic text feature weighting method, which measures the importance of a word in a document [23,24]. Equation ( 3) is the formula of TF-IDF algorithm, where TFðw, dÞ is the word frequency of word w in text d, N is the total number of documents in corpus, and DFðwÞ is the number of documents containing word w in the training corpus.…”
Section: Methodsmentioning
confidence: 99%
“…TF-IDF algorithm is a classic text feature weighting method, which measures the importance of a word in a document [23,24]. Equation ( 3) is the formula of TF-IDF algorithm, where TFðw, dÞ is the word frequency of word w in text d, N is the total number of documents in corpus, and DFðwÞ is the number of documents containing word w in the training corpus.…”
Section: Methodsmentioning
confidence: 99%
“…The fundamental idea behind TF-IDF is that if a word appears frequently in a document but less frequently in other documents, it carries more significance in distinguishing the document and expressing its core content. Hen2vecce, such words are assigned higher weights [ 34 ]. The TF-IDF algorithm calculates a weight for each word in a document based on the term frequency (TF) and inverse document frequency (IDF).…”
Section: Methodsmentioning
confidence: 99%
“…LSTM recurrent calculations are then performed, and the basic LSTM function is shown in Equation 4. Next, softmax calculations are performed based on the input gates of characters and words to obtain weighted coefficients, followed by weighted summation, laying the foundation for subsequent training [24].…”
Section: ) Entity Extractionmentioning
confidence: 99%