Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2013
DOI: 10.1145/2487575.2488209
|View full text |Cite
|
Sign up to set email alerts
|

An integrated framework for optimizing automatic monitoring systems in large IT infrastructures

Abstract: The competitive business climate and the complexity of IT environments dictate efficient and cost-effective service delivery and support of IT services. These are largely achieved by automating routine maintenance procedures, including problem detection, determination and resolution. System monitoring provides an effective and reliable means for problem detection. Coupled with automated ticket creation, it ensures that a degradation of the vital signs, defined by acceptable thresholds or monitoring conditions,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 25 publications
(5 citation statements)
references
References 29 publications
0
5
0
Order By: Relevance
“…LogSig [26] is a data mining based log parsing method that has been demonstrated in [47]. It parses logs through a threestep process: (1) word pair generation, (2) log clustering, and (3) log template generation.…”
Section: B Log Parsing Methods For Servers (Supercomputers) Distributed Systems and Applicationsmentioning
confidence: 99%
“…LogSig [26] is a data mining based log parsing method that has been demonstrated in [47]. It parses logs through a threestep process: (1) word pair generation, (2) log clustering, and (3) log template generation.…”
Section: B Log Parsing Methods For Servers (Supercomputers) Distributed Systems and Applicationsmentioning
confidence: 99%
“…In a number of real-world applications, such as healthcare and network security, it is crucial to reduce the Bayes risk (BR), i.e., the expected misclassification loss. For example, a typical case is to distinguish between the false positive errors and the false negative errors and treat them differently [55], [54].…”
Section: Related Workmentioning
confidence: 99%
“…Alternately, some research efforts, such as those in [37,38,39], have noted the importance of ticket correlation for incident resolution, claiming that the latter can be extended with advanced functions to enhance the incident resolution process, as the information in the tickets is related to incidents generated by events that have already been identified as network failures, and as such, some related alerts should exist. Other efforts, such as those in [40,41,42], use ITSs for several purposes, such as studying and characterizing the nature and causes of routing changes and the observed instability. In these references, the authors use simple ticket preprocessing operations to reduce the total number of tickets before correlating them.…”
Section: Related Workmentioning
confidence: 99%