2014
DOI: 10.1142/s0218843014400012
|View full text |Cite
|
Sign up to set email alerts
|

Quality Dimensions in Process Discovery: The Importance of Fitness, Precision, Generalization and Simplicity

Abstract: Process discovery algorithms typically aim at discovering process models from event logs that best describe the recorded behavior. Often, the quality of a process discovery algorithm is measured by quantifying to what extent the resulting model can reproduce the behavior in the log, i.e. replay fitness. At the same time, there are other measures that compare a model with recorded behavior in terms of the precision of the model and the extent to which the model generalizes the behavior in the log. Furthermore, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
123
0
2

Year Published

2016
2016
2021
2021

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 156 publications
(143 citation statements)
references
References 44 publications
0
123
0
2
Order By: Relevance
“…Most existing discovery algorithms map each unique event label to one task, making it impossible to discover processes with two tasks with the same label. Some discovery algorithms can refine labels during model construction to some extent [1,2,4,5,9]. However, these internal mechanism to handle duplicated tasks can not be used in other discovery algorithms.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Most existing discovery algorithms map each unique event label to one task, making it impossible to discover processes with two tasks with the same label. Some discovery algorithms can refine labels during model construction to some extent [1,2,4,5,9]. However, these internal mechanism to handle duplicated tasks can not be used in other discovery algorithms.…”
Section: Related Workmentioning
confidence: 99%
“…Dozens criteria and measures have been proposed for assessing the quality of discovered models, which may be discussed in three categories. Measures that evaluate the quality of the model using the input log often consider fitness, precision, and generalization [3,5]; we use the fitness defined in [16] and precision in [17]. In the context of controlled experiments, the quality of a discovered model can be evaluated against the original system in terms of how much of the behavior of the system can be reproduced by the discovered model and how precise the model describes the system [18].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Formally: Quality Dimensions. Process mining techniques aim at extracting from a log L a process model N (e.g., a Petri net) with the goal to elicit the process underlying a system S. By relating the behaviors of L, L(N ) and S, particular concepts can be defined [9]. A log is…”
Section: Preliminariesmentioning
confidence: 99%