Many techniques and algorithms for automatic text categorization had been devised and proposed in the literature. However, there is still much space for researchers in this area to improve existing algorithms or come up with new techniques for text categorization (TC). Polynomial Networks (PNs) were never used before in TC. This can be attributed to the huge datasets used in TC, as well as the technique itself which has high computational demands. In this paper, we investigate and propose using PNs in TC. The proposed PN classifier has achieved a competitive classification performance in our experiments. More importantly, this high performance is achieved in one shot training (noniteratively) and using just 0.25%-0.5% of the corpora features. Experiments are conducted on the two benchmark datasets in TC: Reuters-21578 and the 20 Newsgroups. Five well-known classifiers are experimented on the same data and feature subsets: the state-of-the-art Support Vector Machines (SVM), Logistic Regression (LR), the k-nearest-neighbor (kNN), Naive Bayes (NB), and the Radial Basis Function (RBF) networks.
Several Text Categorization (TC) techniques and algorithms have been investigated in the limited research literature of Arabic TC. In this research, Logistic Regression (LR) is investigated in Arabic TC. To the best of our knowledge, LR was never used for Arabic TC before. Experiments are conducted on Aljazeera Arabic News (Alj-News) dataset. Arabic text-preprocessing takes place on this dataset to handle the special nature of Arabic text. Experimental results of this research prove that the LR classifier is a competitive Arabic TC algorithm to the state of the art ones in this field; it has recorded a precision of 96.5% on one category and above 90% for 3 categories out of the five categories of Alj-News dataset. Regarding the overall performance, LR has recorded a macroaverage precision of 87%, recall of 86.33% and Fmeasure of 86.5%.
In this paper, the role of rare or infrequent terms in enhancing the accuracy of English Text Categorization using Polynomial Networks (PNs) is investigated. To study the impact of rare terms in enhancing the accuracy of PNs-based text categorization, different term reduction criteria as well as different term weighting schemes were experimented on the Reuters Corpus using PNs. Each term weighting scheme on each reduced term set was tested once keeping the rare terms and another time removing them. All the experiments conducted in this research show that keeping rare terms substantially improves the performance of Polynomial Networks in Text Categorization, regardless of the term reduction method, the number of terms used in classification, or the term weighting scheme adopted.
Many Text Classification (TC) algorithms have been proposed for Arabic TC. Polynomial Neural Networks (PNNs) were used recently in English TC, and have proved to be competitive to the state of the art text classifiers in this field. Lately, they were proposed for classifying Arabic documents. In this research paper, an experimental study that directly compares PNNs against five famous classification algorithms in TC is conducted on Aljazeera-News Arabic dataset. All experiments use the same TC settings, like preprocessing, Feature Selection (FS) and reduction criteria, feature weighting and classifier performance evaluation measures. These algorithms are: SVM (Support Vector Machines), NB (Naive Bayes), kNN (k-Nearest-Neighbor), LR (Logistic Regression) and RBF (Radial Basis Function networks). Results reached in this study reveal that PNN are competitive classifiers in the field of Arabic TC.
The congestion problem at the router buffer leads to serious consequences on network performance. Active Queue Management (AQM) has been developed to react to any possible congestion at the router buffer at an early stage. The limitation of the existing fuzzy-based AQM is the utilization of indicators that do not address all the performance criteria and quality of services required. In this paper, a new method for active queue management is proposed based on using the fuzzy logic and multiple performance indicators that are extracted from the network performance metrics. These indicators are queue length, delta queue and expected loss. The simulation of the proposed method show that in high traffic load, the proposed method preserves packet loss, drop packet only when it is necessary and produce a satisfactory delay that outperformed the state-of-the-art AQM methods.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.