Text categorization (TC) is one of the main applications of machine learning. IkfaRy methods have been proposed, such as Rocchio method, Naive bayes based method, and SVM based text classification method.
These methods learn labeled text documents and then comtruct a classifier. A new coming text document's category can be predicted. However, these methods do not give the description of each category. In the machine learning field, there are many concept learning algorithms, such as, ID3 and CN2. This paper proposes a more robust algorithm to induce concepts from training examples, which is based on enumeration of all possible keywords combinations. Experimental results show thatthe rules produced by our approach have more precision and simplicity than that of other methods.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.