2023
DOI: 10.1109/tnsm.2023.3239522
|View full text |Cite
|
Sign up to set email alerts
|

LogEncoder: Log-Based Contrastive Representation Learning for Anomaly Detection

Abstract: The increasing volume of log data produced by software-intensive systems makes it impractical to analyze them manually. Many deep learning-based methods have been proposed for log-based anomaly detection. These methods face several challenges such as high-dimensional and noisy log data, class imbalance, generalization, and model interpretability. Recently, ChatGPT has shown promising results in various domains. However, there is still a lack of study on the application of ChatGPT for log-based anomaly detectio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
4
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 56 publications
0
4
0
Order By: Relevance
“…Common problems encountered in log-based anomaly detection include the unstructured nature of log messages, each system having its own log characteristics, being less humanreadable, and containing many special or technical terms [22], [23]. To deal with these problems, a typical ML/DLbased log-based anomaly detection comprises several stages, i.e., log cleansing, log parsing, feature extraction, model training, model testing, and model evaluation.…”
Section: A Anomaly Detection On Logs Datamentioning
confidence: 99%
See 3 more Smart Citations
“…Common problems encountered in log-based anomaly detection include the unstructured nature of log messages, each system having its own log characteristics, being less humanreadable, and containing many special or technical terms [22], [23]. To deal with these problems, a typical ML/DLbased log-based anomaly detection comprises several stages, i.e., log cleansing, log parsing, feature extraction, model training, model testing, and model evaluation.…”
Section: A Anomaly Detection On Logs Datamentioning
confidence: 99%
“…Nevertheless, similar but contradicting log events ended up having features that are close to one another in the latent space. Dealing with such an issue, Qi et al [22] propose a contrastive-based approach which consists of a representation learning model to provide a decent input to a one-class classifier to distinguish the normal from the abnormal log samples. Instead of removing the parameter values in a log message when performing log parsing, the information can be used as an additional feature along with the metadata of the logs to improve the model's performance [19].…”
Section: A Anomaly Detection On Logs Datamentioning
confidence: 99%
See 2 more Smart Citations
“…Log parsing methods, such as Spell [23], IPLoM [24], Drain [25], and so on. According to previous studies [26], weak log parsing methods can negatively affect the performance of subsequent anomaly detection tasks.Drain has become a common method used by researchers with its superior accuracy and efficiency [27,28]. Therefore, in this work, we use Drain for log message parsing.As shown in Figure 2, the HDFS dataset is used as an example.…”
Section: Preprocessingmentioning
confidence: 99%