This paper presents a novel method to enhance the performance of Clustering-based Autoencoder models for network anomaly detection. Previous studies have developed regularized variants of Autoencoders to learn the latent representation of normal data in a semi-supervised manner, including Shrink Autoencoder, Dirac Delta Variational Autoencoder and Clusteringbased Autoencoder. However, there are concerns regarding the feature selection of the original data, which stronger support Autoencoders models exploring more intrinsic, meaningful and latent features at bottleneck. The method proposed involves combining Principal Component Analysis and Clustering-based Autoencoder. Specifically, PCA is used for the selection of new data representation space, aiming to better assist CAE in learning the latent, prominent features of normal data, which addresses the aforementioned concerns. The proposed method is evaluated using the standard benchmark NSL-KDD data set and four scenarios of the CTU13 datasets. The promising experimental results confirm the improvements offered by the proposed approach, in comparison to existing methods. Therefore, it suggests a strong potential application within modern network anomaly detection systems.
Despite the continued advancements in security research, malware persists as being a major threat in this digital age. Malware detection is a primary defence strategy for most networks but the identification of malware strains is becoming increasingly difficult. Reliable identification is based upon characteristic features being detectable within an object. However, the limitations and expense of current malware feature extraction methods is significantly hindering this process. In this paper, we present a new method for identifying malware based on behavioural feature extraction. Our proposed method has been evaluated using seven classification methods whilst analysing 2,068 malware samples from eight different families. The results achieved thus far have demonstrated promising improvements over existing approaches.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.