2021
DOI: 10.48550/arxiv.2103.01782
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

MicroHECL: High-Efficient Root Cause Localization in Large-Scale Microservice Systems

Abstract: Availability issues of industrial microservice systems (e.g., drop of successfully placed orders and processed transactions) directly affect the running of the business. These issues are usually caused by various types of service anomalies which propagate along service dependencies. Accurate and high-efficient root cause localization is thus a critical challenge for large-scale industrial microservice systems. Existing approaches use service dependency graph based analysis techniques to automatically locate ro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
10
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(10 citation statements)
references
References 18 publications
0
10
0
Order By: Relevance
“…Graph-based Analysis. The approach of building and processing topology graphs for determining the possible root causes of performance anomalies is adopted by MonitorRank [38] and MicroHECL [44]. They indeed both rely on services to produce interaction traces, including the start and end times of service interactions, their source and target services, performance metrics (viz., latency, error count, and throughput), and the unique identifier of the corresponding user request.…”
Section: Topologymentioning
confidence: 99%
See 4 more Smart Citations
“…Graph-based Analysis. The approach of building and processing topology graphs for determining the possible root causes of performance anomalies is adopted by MonitorRank [38] and MicroHECL [44]. They indeed both rely on services to produce interaction traces, including the start and end times of service interactions, their source and target services, performance metrics (viz., latency, error count, and throughput), and the unique identifier of the corresponding user request.…”
Section: Topologymentioning
confidence: 99%
“…This information is then processed to reconstruct service invocation chains for the same user request, which are then combined to build the application topology. They however differ because they consider different time slices for building topologies, and since MonitorRank [38] and MicroHECL [44] exploit topologies to determine the root causes of application-level and servicelevel anomalies, respectively. MonitorRank [38] and MicroHECL [44] also differ in the method applied to enact root cause analysis: MonitorRank [38] performs a random walk on the topology graph, whereas MicroHECL explores it through a breadth-first search (BFS).…”
Section: Topologymentioning
confidence: 99%
See 3 more Smart Citations