2012
DOI: 10.1080/17517575.2012.665945
|View full text |Cite
|
Sign up to set email alerts
|

Term frequency – function of document frequency: a new term weighting scheme for enterprise information retrieval

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 13 publications
(5 citation statements)
references
References 33 publications
0
5
0
Order By: Relevance
“…For the calculation method of word frequency features, researchers mainly use the classical TF-IDF (word frequency-inverse document frequency) algorithm [19,24,27], because TF-IDF will give higher weights to some common words, which will lead to lower weights of some burst words, research scholars have improved the TF-IDF algorithm [36,37], and some other researchers have proposed the DF-IDF (document frequencyinverse document frequency) algorithm [28] to make up for the defects of TF-IDF. The word frequency calculation method mentioned above is applied only to a single data source, and for the problem of multiple data sources, Bun et al proposed a novel TF*PDF (Term Frequency * Proportional Document Frequency) algorithm [39,40]. The algorithm describes that whenever a popular topic is being discussed, that topic is frequently discussed in numerous news documents from most news sources.…”
Section: Word Frequency Featuresmentioning
confidence: 99%
“…For the calculation method of word frequency features, researchers mainly use the classical TF-IDF (word frequency-inverse document frequency) algorithm [19,24,27], because TF-IDF will give higher weights to some common words, which will lead to lower weights of some burst words, research scholars have improved the TF-IDF algorithm [36,37], and some other researchers have proposed the DF-IDF (document frequencyinverse document frequency) algorithm [28] to make up for the defects of TF-IDF. The word frequency calculation method mentioned above is applied only to a single data source, and for the problem of multiple data sources, Bun et al proposed a novel TF*PDF (Term Frequency * Proportional Document Frequency) algorithm [39,40]. The algorithm describes that whenever a popular topic is being discussed, that topic is frequently discussed in numerous news documents from most news sources.…”
Section: Word Frequency Featuresmentioning
confidence: 99%
“…A framework for data mining of key safety factors from safety-related cases of consumer products is proposed and depicted in Figure 1. Data mining theory, methods and algorithms have been discussed by many authors in the literature (Wilamowski et al, 1999;Wilamowski and Kaynak, 2000;Li and Xu, 2001;Li et al, 2003Li et al, , 2009Li et al, , 2013aLi et al, , 2013bDuan et al, 2007Duan et al, ,2009Shi et al, 2007;Hewlett et al, 2008;Xu et al, 2008;Wilamowski, 2010;Duan and Xu, 2012;Fritzsche et al, 2012;Hunter et al, 2012;Ingvaldsen and Gulla, 2012;Xu et al, 2012;Yang et al, 2012;Yu et al, 2012;Zhang et al, 2012;Bulysheva and Bulyshev, 2013;Katayev et al, 2013;Wang et al, 2013b;Xia et al, 2013;Xing et al, 2013;Zeng et al, 2013aZeng et al, , 2013b. The functional modules in the framework are classified into three phases, that is, cases collection, extraction of impact factors and retrieving of key factors by knowledge reasoning based on the Bayesian network.…”
Section: A Data Mining Frameworkmentioning
confidence: 99%
“…In an effort to solve this issue, we have used the latest eye‐tracking technology in our study to research how different forms of ads affect the attention of streaming media advertisement. The eye‐tracking technology has been frequently used in information studies (Burns and Lutz, ; Moore et al ., ; Beynon‐Davies, ; Yang et al ., ; Zhang et al ., ).…”
Section: Introductionmentioning
confidence: 97%