Encyclopedia of Machine Learning 2011
DOI: 10.1007/978-0-387-30164-8_832
|View full text |Cite
|
Sign up to set email alerts
|

Tf–idf

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0
2

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
4
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 84 publications
(25 citation statements)
references
References 0 publications
0
23
0
2
Order By: Relevance
“…For building conventional ML models, we chose Random Forest (RF) [2] and Support Vector Machine (SVM) [41] as candidate models since these models are observed to be superior performers in the classification of short texts like social media comments [35]. The features that we used in these model building are Regex status and sentiment features along with TF-IDF Vectorizer [33]. The sentiment-related features used in our work are sentiment polarity at the comment level, which can be positive, negative or neutral, and polarity score, which indicates how strongly the sentiment has been expressed in the comment.…”
Section: Ml-based Classificationmentioning
confidence: 99%
“…For building conventional ML models, we chose Random Forest (RF) [2] and Support Vector Machine (SVM) [41] as candidate models since these models are observed to be superior performers in the classification of short texts like social media comments [35]. The features that we used in these model building are Regex status and sentiment features along with TF-IDF Vectorizer [33]. The sentiment-related features used in our work are sentiment polarity at the comment level, which can be positive, negative or neutral, and polarity score, which indicates how strongly the sentiment has been expressed in the comment.…”
Section: Ml-based Classificationmentioning
confidence: 99%
“…For each evaluator, we first collect the feature descriptions of all 115 clone groups that were resolved as valid in the intra-clone group validation phase. To reduce manual effort and chances of incurring human error while analyzing all 115 2 = 6555 combinations of feature descriptions, we form a subset of the clone group descriptions of each evaluator based on TF-IDF [114] similarity. After stemming all words in the descriptions, we calculate pair-wise similarity between all feature descriptions of an evaluator using a TF-IDF similarity score.…”
Section: Inter-clone Group Dissimilarity Validationmentioning
confidence: 99%
“…Then obtained: Centroid 1=0.3 and Centroid 2=0.3 as explained in Table 5 and the visualization explained in Figure 2. Next calculate the distance of each data with each weight using (1) [24,25]:…”
Section: Figure 1 Tf-idf Data On Graphmentioning
confidence: 99%