With the advent of computers and the information age, statistical and analytical problems have grown in terms of both size and complexity. Challenges in core domains of data storage, organisation and searching have evolved to the new research field called data mining. Text classification using various machine learning (ML) mechanisms encounters the difficulty of the high dimensionality of attributes vector. Therefore, a feature selection technique is very much required to discard irrelevant as well as noisy attributes from the feature set vector so that the ML algorithms can work efficiently. In this paper, rough set theory (RST)-based attribute selection methodology is applied to achieve text classification goal. A hybrid method based on RST is proposed for text documents classification. Further, the proposed method’s performance is evaluated on standard datasets.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.