2020
DOI: 10.1155/2020/6076272
|View full text |Cite
|
Sign up to set email alerts
|

A New Feature Selection Method for Text Classification Based on Independent Feature Space Search

Abstract: Feature selection method is designed to select the representative feature subsets from the original feature set by different evaluation of feature relevance, which focuses on reducing the dimension of the features while maintaining the predictive accuracy of a classifier. In this study, we propose a feature selection method for text classification based on independent feature space search. Firstly, a relative document-term frequency difference (RDTFD) method is proposed to divide the features in all text docum… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(14 citation statements)
references
References 31 publications
0
14
0
Order By: Relevance
“…The latest advances in feature selection are a combination of feature selection with deep learning especially the Convolutional Neural Networks (CNN) for classification tasks, such as applications in bioinformatics neurodegenerative disorders classification using the Principal Components Analysis (PCA) algorithm [112,113], brain tumor segmentation [114] using three planar super pixel based statistical and textural features extraction. Next, remote sensing imagery classification using a fusion of CNN and RF [115], and software fault prediction [116] using enhanced binary moth flame optimization as a feature selection, and text classification based on independent feature space search [117].…”
Section: Evaluation Performance and Discussionmentioning
confidence: 99%
“…The latest advances in feature selection are a combination of feature selection with deep learning especially the Convolutional Neural Networks (CNN) for classification tasks, such as applications in bioinformatics neurodegenerative disorders classification using the Principal Components Analysis (PCA) algorithm [112,113], brain tumor segmentation [114] using three planar super pixel based statistical and textural features extraction. Next, remote sensing imagery classification using a fusion of CNN and RF [115], and software fault prediction [116] using enhanced binary moth flame optimization as a feature selection, and text classification based on independent feature space search [117].…”
Section: Evaluation Performance and Discussionmentioning
confidence: 99%
“…We selected words based on their ability to discriminate between T and C, via a combination of filter and embedded methods [56-60] resulting in 41,664 words (Figure 2). The T document vectors were reduced to include only the 41,664 words (see Figure 1.e for a truncated example).…”
Section: Methodsmentioning
confidence: 99%
“…Peng & Fan [52] 2017 By optimizing lower bound of conditional mutual information SFR [54] 2018 Uses subspace feature clustering to identify feature clusters CFS [55] 2018 Similar to MRMR and uses composition of feature relevancy Wang et al [59] 2019 Uses rough set theory based relative neighborhood self-information on both lower and upper approximations. PRFS [60] 2020 Proportional Rough Feature Selection based on rough set for regional distinction Liu et al [61] 2020 Independent feature space search using relative doc-term frequency difference for class correlation and redundancy Hossny et al [62] 2020 Uses text mining specifics e.g., word count, word forms such as n-gram, skip-gram, etc. Gao et al [65] 2020 min-redundancy and max-dependency (MRMD) using relevancy with a class given selected features…”
Section: Selection Methods Year Key Idea/advantage/applicationmentioning
confidence: 99%