IEC2017 Proceedings Book 2017
DOI: 10.23918/iec2017.16
|View full text |Cite
|
Sign up to set email alerts
|

Improving TF-IDF with Singular Value Decomposition (SVD) for Feature Extraction on Twitter

Abstract: Feature extraction is provided a lot of significance in social networks such as Twitter, due to playing a vital role in public opinion analysis. Several algorithms are suggested for solving them. Feature extractions are generally defined as to the process of extracting interesting features, non-trivial and knowledge from unstructured text documents. Feature extractions are interdisciplinary field which depends on information retrieval, machine learning, parameter statistics and computational linguistics. This … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 9 publications
0
3
0
Order By: Relevance
“…Tokenization describes interpreting and grouping isolated tokens to generate higher-level tokens in addition to dividing strings into fundamental processing units. Word tokenization includes preprocessed raw texts are divided into textual units 18,19 . During the dataset cleaning stage, the columns from the datasets that weren't needed for processing were removed.…”
Section: Data Pre-processingmentioning
confidence: 99%
“…Tokenization describes interpreting and grouping isolated tokens to generate higher-level tokens in addition to dividing strings into fundamental processing units. Word tokenization includes preprocessed raw texts are divided into textual units 18,19 . During the dataset cleaning stage, the columns from the datasets that weren't needed for processing were removed.…”
Section: Data Pre-processingmentioning
confidence: 99%
“…As shown in [18], dimension reduction to TF-IDF can be achieved using Singular Value Decomposition (SVD) which is a dimensionality reduction method that works well with sparse matrices. Equation 4 shows how SVD is calculated.…”
Section: Feature Extractionmentioning
confidence: 99%
“…This diagonal matrix has the square root of eigenvalues of V or U. The SVD offers a strong mathematical foundation for the field of text analytics [3]. In SVD, the matrix A represents the text document as a high dimension vector space model.…”
Section: A Singular Value Decompositionmentioning
confidence: 99%