Analysis of Language Inspired Trace Representation for Anomaly Detection

Tavares, Gabriel Marques; BarbonJr., Sylvio

doi:10.1007/978-3-030-55814-7_25

Cited by 14 publications

(13 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Trace2vec representations were used to cluster traces into similar groups in [4]. Tavares et al [13] use the same representations to identify anomalous cases.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Case2vec: Advances in Representation Learning for Business Processes

Luettgen

Seeliger

Nolle

et al. 2021

Lecture Notes in Business Information Processing

View full text Add to dashboard Cite

The execution of a business process is often determined by the surrounding context, e.g., department, product, or other attributes an event provides. Process discovery mainly focuses on the executed activities, although the context of a case may be needed to accurately represent a process instance, e.g., for clustering, prediction, or anomaly detection. Hence, in this paper, we present a representation learning technique (Case2vec) using word embeddings for business process data to better encode process instances. Our work extends Trace2vec and incorporates an additional semantic level by using not only the activity name but also the attributes and thereby incorporating the context. We evaluate our approach in the context of trace clustering. Additionally, we show that Case2vec can be used to abstract events which are semantically similar but syntactically different. We also show that word embeddings allow for interpretability when employing vector space arithmetic.

show abstract

“…Trace2vec representations were used to cluster traces into similar groups in [4]. Tavares et al [13] use the same representations to identify anomalous cases.…”

Section: Related Workmentioning

confidence: 99%

“…Vector representations of cases are required by many techniques in process mining such as trace clustering [4,11,12], prediction [3], and anomaly detection [10,13]. Trace clustering aims to improve the discovery of process models by grouping similar cases.…”

Section: Introductionmentioning

confidence: 99%

Case2vec: Advances in Representation Learning for Business Processes

Luettgen

Seeliger

Nolle

et al. 2021

Lecture Notes in Business Information Processing

View full text Add to dashboard Cite

show abstract

“…The schemes (b) and (c) are known as countbased vector space models [Appice and Malerba 2016]. In recent years, representations built through learning models have received the attention of process mining researchers [Koninck et al 2018, Peeperkorn et al 2020, Tavares and Barbon 2020. Learning models are trained on the event log and encode entities (e.g., an activity or a trace) in a vector space such that entities involved in similar contexts are expected to be positioned closer in the vector space.…”

Section: Vector Space Modelsmentioning

confidence: 99%

“…Solutions for other problems in process mining using embedding-based representations for traces were studied by Peeperkorn et al [Peeperkorn et al 2020] and Tavares and Barbon [Tavares and Barbon 2020]. The former applied such a representation scheme for conformance checking.…”

Section: Related Workmentioning

confidence: 99%

Vector space models for trace clustering: a comparative study

Luna

Lima

Neubauer

et al. 2021

Anais Do XVIII Encontro Nacional De Inteligência Artificial E Computacional (ENIAC 2021)

View full text Add to dashboard Cite

Process mining explores event logs to offer valuable insights to business process managers. Some types of business processes are hard to mine, including unstructured and knowledge-intensive processes. Then, trace clustering is usually applied to event logs aiming to break it into sublogs, making it more amenable to the typical process mining task. However, applying clustering algorithms involves decisions, such as how traces are represented, that can lead to better results. In this paper, we compare four vector space models for trace clustering, using them with an agglomerative clustering algorithm in synthetic and real-world event logs. Our analyses suggest the embeddings-based vector space model can properly handle trace clustering in unstructured processes.

show abstract

“…In the last paper, "Analysis of language inspired trace representation for anomaly detection" [28], the authors develop a comparative study about approaches using vector space modeling for trace profiling. Their comparison can guide the appropriate trace profiling choice for all methods working at the intersection of Process Mining and Machine Learning.…”

Section: Selected Papersmentioning

confidence: 99%