2013 IEEE 33rd International Conference on Distributed Computing Systems 2013
DOI: 10.1109/icdcs.2013.26
|View full text |Cite
|
Sign up to set email alerts
|

FChain: Toward Black-Box Online Fault Localization for Cloud Systems

Abstract: Abstract-Distributed applications running inside cloud systems are prone to performance anomalies due to various reasons such as resource contentions, software bugs, and hardware failures. One big challenge for diagnosing an abnormal distributed application is to pinpoint the faulty components. In this paper, we present a black-box online fault localization system called FChain that can pinpoint faulty components immediately after a performance anomaly is detected. FChain first discovers the onset time of abno… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
43
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 70 publications
(43 citation statements)
references
References 22 publications
(21 reference statements)
0
43
0
Order By: Relevance
“…Such modifications are generally intrusive with significant runtime overhead. Black-box applications expose detail visibility into the application thus limiting the amount of insights achievable but they be profiled in a non-intrusive manner [60].…”
Section: Data Collectionmentioning
confidence: 99%
See 1 more Smart Citation
“…Such modifications are generally intrusive with significant runtime overhead. Black-box applications expose detail visibility into the application thus limiting the amount of insights achievable but they be profiled in a non-intrusive manner [60].…”
Section: Data Collectionmentioning
confidence: 99%
“…Conversely, observing and understanding in-out tra c of hundreds of black-box components without prior knowledge may yield high false positive detections. [ [45]; [33]; [50]; [82]; [60] …”
Section: Flow and Dependency Analysismentioning
confidence: 99%
“…Tan et al presented a method of using two Bayesian classifiers for context-aware anomaly detection (one for context classification, one for anomaly detection). Aguilera et al proposed a black box performance debugging technique by analyzing message-level traces of system activities to infer causal paths [27]. Different from the work above, our work seeks to find an unsupervised method free from labeling historical data.…”
Section: Related Workmentioning
confidence: 99%
“…Therefore, they are generally not helpful for locating faults that have occurred in a program. Previous research [13] on black-box testing proposed an online fault localization system for finding the faulty component when a program is functioning abnormally. In other words, the proposed method identifies the component causing the failure in a multicomponent system, rather than finding the exact fault location in the program.…”
Section: Testing and Debugging In A Black-box Environmentmentioning
confidence: 99%