Kazuki Otomo scite author profile

System logs are useful to understand the status of and detect faults in large scale networks. However, due to their diversity and volume of these logs, log analysis requires much time and effort. In this paper, we propose a log event anomaly detection method for large-scale networks without pre-processing and feature extraction. The key idea is to embed a large amount of diverse data into hidden states by using latent variables. We evaluate our method with 12 months of system logs obtained from a nationwide academic network in Japan. Through comparisons with Kleinberg's univariate burst detection and a traditional multivariate analysis (i.e., PCA), we demonstrate that our proposed method achieves 14.5% higher recall and 3% higher precision than PCA. A case study shows detected anomalies are effective information for troubleshooting of network system faults.

show abstract

A knowledge sharing system using XML Linking Language and peer-to-peer technology

Satō

Otomo²,

Masuo³

View full text Add to dashboard Cite

Causal analysis of network logs with layered protocols and topology knowledge

Kobayashi¹,

Otomo

Fukuda

2019

View full text Add to dashboard Cite

An Analysis of Burstiness and Causality of System Logs

Otomo

Kobayashi

Fukuda

et al. 2017

View full text Add to dashboard Cite

amulog: A General Log Analysis Framework for Diverse Template Generation Methods

Kobayashi¹,

Yamashiro

Otomo

et al. 2020

View full text Add to dashboard Cite

amulog: A general log analysis framework for comparison and combination of diverse template generation methods*

et al. 2021

View full text Add to dashboard Cite

Summary One of the ways to analyze unstructured log messages from large‐scale IT systems is to classify log messages with log templates generated by template generation methods. However, there is currently no common knowledge pertained to the comparison and practical use of log template generation methods because they are implemented on the basis of diverse environments. To this end, we design and implement amulog, a general log analysis framework for comparing and combining diverse log template generation methods. Amulog consists of three key functions: (1) parsing log messages into headers and segmented messages, (2) classifying the log messages using a scalable template‐matching method, and (3) storing the structured data in a database. This framework helps us easily utilize time‐series data corresponding to the log templates for further analysis. We evaluate amulog with a log dataset collected from a nation‐wide academic network and demonstrate that it classifies the log data in a reasonable amount of time even with over 100,000 log template candidates. The template‐matching method in amulog also reduces 75% processing time for template generation and keeps the accuracy when combined with an existing structure‐based template generation method. In order to show the effectiveness of amulog in comparing log template generation methods, we demonstrate that the appropriate template generation methods and accuracy metrics largely depend on the purpose of further analysis by comparing the accuracy of six existing log template generation methods with 10 different accuracy metrics on amulog.

show abstract

Finding Anomalies in Network System Logs with Latent Variables

Otomo

Kobayashi²,

Fukuda

et al. 2018

View full text Add to dashboard Cite

System logs are useful to understand the status of and detect faults in large scale networks. However, due to their diversity and volume of these logs, log analysis requires much time and effort. In this paper, we propose a log event anomaly detection method for largescale networks without pre-processing and feature extraction. The key idea is to embed a large amount of diverse data into hidden states by using latent variables. We evaluate our method with 15 months of system logs obtained from a nationwide academic network in Japan. Through comparisons with Kleinberg's univariate burst detection and a traditional multivariate analysis (i.e., PCA), we demonstrate that our proposed method detects anomalies and ease troubleshooting of network system faults.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Kazuki Otomo

Mining Causality of Network Events in Log Data

Latent Variable Based Anomaly Detection in Network System Logs

A knowledge sharing system using XML Linking Language and peer-to-peer technology

Causal analysis of network logs with layered protocols and topology knowledge

An Analysis of Burstiness and Causality of System Logs

amulog: A General Log Analysis Framework for Diverse Template Generation Methods

amulog: A general log analysis framework for comparison and combination of diverse template generation methods*

Finding Anomalies in Network System Logs with Latent Variables

Contact Info

Product

Resources

About