2020
DOI: 10.48550/arxiv.2012.08938
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Summarizing Unstructured Logs in Online Services

Weibin Meng,
Federico Zaiter,
Yuheng Huang
et al.

Abstract: Logs are one of the most valuable data sources for managing large-scale online services. After a failure is detected/diagnosed/predicted, operators still have to inspect the raw logs to gain a summarized view before take actions. However, manual or rule-based log summarization has become inefficient and ineffective. In this work, we propose LogSummary, an automatic, unsupervised end-to-end log summarization framework for online services. LogSummary obtains the summarized triples of important logs for a given l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(6 citation statements)
references
References 27 publications
0
6
0
Order By: Relevance
“…We demonstrate the experimental results for log data summarization. We choose TextRank [41], a classic keyword extraction algorithm, and LogSummary [6] the state-of-the-art method design specifically to summarize log data. Table V reports the experimental results; it shows that the UniLog can significantly improve our log summarization performance: UniLog is improving the F1 score by 0.070 on BGL data, 0.031 on HPC data, and 0.036 on Proxifier over LogSummary.…”
Section: B Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…We demonstrate the experimental results for log data summarization. We choose TextRank [41], a classic keyword extraction algorithm, and LogSummary [6] the state-of-the-art method design specifically to summarize log data. Table V reports the experimental results; it shows that the UniLog can significantly improve our log summarization performance: UniLog is improving the F1 score by 0.070 on BGL data, 0.031 on HPC data, and 0.036 on Proxifier over LogSummary.…”
Section: B Resultsmentioning
confidence: 99%
“…In our case, it is necessary to keep the semantics presented in the logs for the output summary to be readable. Meng et al [6] present a log summarization dataset and designed a log summary framework by using the rule-based algorithm to produce a triplet for each log data. We extend their work to the deep learning approach, which is more general and practical for cross-service log data.…”
Section: B Log Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…Another study by Meng et al [32] proposed an automated online streaming log model known as LogSummary, which is based on unsupervised algorithmic approaches and can be considered an end-to-end log summarization model that is far superior to that of the current heuristics' approaches. The study claimed that an efficient log parsing model should primarily consider three vital information-1) "entities"; 2) "events"; and 3) their "relationships."…”
Section: B Heuristics-based Log Parsingmentioning
confidence: 99%
“…The study claimed that an efficient log parsing model should primarily consider three vital information-1) "entities"; 2) "events"; and 3) their "relationships." The concept is stemmed from the idea of OpenIE to generate summarized triple-sets of all the vital log messages in a given log sequence [32]. The study avoided the ranking of log summaries that are traditional in OpenIE models and instead used "Log2Vec" to learn log semantics and train the word embedding representations.…”
Section: B Heuristics-based Log Parsingmentioning
confidence: 99%