<h4>Contribution</h4>
<p>We provide initial algorithms and a framework leading to assisted root cause analysis through a modular architecture including collection, identification, analysis, and presentation steps. Arcalog, our proposed framework, creates pre-structured data from vast heterogeneous datasets automatically, enriches the data with additional information from the CI system, and adds fine-grained default and user-defined labels that support the root cause analysis of failures.</p>
<h4>Background</h4>
<p>Projects spanning hundreds of thousands of lines of code and several thousand daily continuous integration workflows cannot rely on manual prelabeling and qualitative interviews to generate meaningful improvements to broken CI job runs.</p>
<h4>Evaluation</h4>
<p>We evaluated our approach by measuring manual root cause analysis times over several CI jobs. The data we used is publicly available via the <a href="https://kubernetes.io/" target="_blank">Kubernetes</a> and <a href="https://www.redhat.com/en/technologies/cloud-computing/openshift" target="_blank">OpenShift</a> projects, allowing every researcher to continue and reproduce our work.</p>
<h4>Community</h4>
<p>In order to create reproducible workflows and improve debugging together, we have created the open Arcalot community. Join our round table, suggest enhancements, and vote on the next roadmap items!</p>