Two types of control charts exist based on different quality characteristics: variable and attribute. These characteristics are commonly monitored using separate procedures. Only a few studies focused on the utilization of control charts to monitor a process with mixed characteristics. This study develops a new concept of the T 2 control chart based on a Principal Component Analysis (PCA) Mix, that is a PCA method that can jointly handle continuous and categorical data. The Kernel Density Estimation (KDE) method is used to estimate the control limit. Through simulation studies, the performance of the proposed chart is evaluated using the Average Run Length (ARL). T 2 control limits obtained from KDE produce a stable ARL 0 at~370 for α ¼ 0:00273: For the shifted process, the proposed chart demonstrates excellent performance for an appropriate number of principal components used. Applications of the simulated process and real cases show that the proposed chart is sensitive to monitoring the shifted process.
Statistical Process Control (SPC) has been widely used in industry and services. The SPC can be applied not only to monitor manufacture processes but also can be applied to the Intrusion Detection System (IDS). In network monitoring and intrusion detection, SPC can be a powerful tool to ensure system security and stability in a network. Theoretically, Hotelling's T 2 chart can be used in intrusion detection. However, there are two reasons why the chart is not suitable to be used. First, the intrusion detection data involves large volumes of high-dimensional process data. Second, intrusion detection requires a fast computational process so an intrusion can be detected as soon as possible. To overcome the problems caused by large number of quality characteristics, Principal Component Analysis (PCA) can be used. The PCA can reduce not only the dimension leading a faster computational, but also can eliminate the multicollinearity (among characteristic variables) problem. This paper is focused on the usage of multivariate control chart T 2 based on PCA for IDS. KDD99 dataset is used to evaluate the performance of the proposed method. Furthermore, the performance of T 2 based PCA will be compared with conventional T 2 control chart. The empirical results of this research show that the multivariate control chart using Hotelling's T 2 based on PCA has excellent performance to detect anomaly in network. Compared to conventional T 2 control chart, the T 2 based on PCA has similar performance with 97 percent hit rate. It also requires shorter computation time.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.