2015 IEEE Conference on Computer Communications (INFOCOM) 2015
DOI: 10.1109/infocom.2015.7218516
|View full text |Cite
|
Sign up to set email alerts
|

Real-time failure prediction in online services

Abstract: Current data mining techniques used to create failure predictors for online services require massive amounts of data to build, train, and test the predictors. These operations are tedious, time consuming, and are not done in real-time. Also, the accuracy of the resulting predictor is highly compromised by changes that affect the environment and working conditions of the predictor. We propose a new approach to creating a dynamic failure predictor for online services in real-time and keeping its accuracy high du… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
7
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(7 citation statements)
references
References 8 publications
0
7
0
Order By: Relevance
“…Using log files for failure detection, diagnosis and detection has been widely applied in ISP networks [13], [16], computers [14], [17]- [21], [40], [41], and online ad services [42]. Liang et al investigated the RAS event logs and developed three simple failure prediction techniques based on not only the characteristics of failure events, but also the correlation between failure events and non-failure events [40].…”
Section: Related Workmentioning
confidence: 99%
“…Using log files for failure detection, diagnosis and detection has been widely applied in ISP networks [13], [16], computers [14], [17]- [21], [40], [41], and online ad services [42]. Liang et al investigated the RAS event logs and developed three simple failure prediction techniques based on not only the characteristics of failure events, but also the correlation between failure events and non-failure events [40].…”
Section: Related Workmentioning
confidence: 99%
“…Many studies have been conducted for finding anomalies and their root causes [2], [4], [5] in log data. Zhong et al [3] proposed an anomaly detection method for both device and network errors with fine log time series feature creation.…”
Section: Related Workmentioning
confidence: 99%
“…Many studies have been conducted for finding anomalies and their root causes [1,12,14] in log data. Zhong et al [18] proposed an anomaly detection method for both device and network errors with fine log time series feature creation.…”
Section: Related Workmentioning
confidence: 99%
“…One day long data(1,440 min) for finding anomalies and their root causes have been proposed to overcome this problem [1,9,11,12,14,18]. In many cases, one first classifies the logs by their message type (i.e., log template) then treats them as statistical time series to be later processed through statistical analysis.…”
Section: Introductionmentioning
confidence: 99%