Duplicate and redundant workflows can be avoided by encouraging workflow reuse. In this paper, we present how workflow similarity matching approach can be used to further enhance existing workflow modeling tools. Most existing workflow similarity algorithms cater for controlflow oriented types of workflow which are typically associated with business workflows. The increase presence of scientific workflows that are mainly dataflow oriented calls for workflow similarity matching that caters for these types of workflows instead. We demonstrate here how our work of applying a behavioral analysis technique (taking into consideration the causal footprint of the workflow) that has been used for finding similarity in business workflows perform when use for scientific workflows. The distinction of our technique is the use of data provenance within the scientific workflow model where positional information of the workflow activities are taken in consideration in order to find matching workflow models. Preliminary experiments have shown that our proposed solution provides a viable alternative for matching scientific workflows within multiple scenarios. Furthermore, our suggested approach performs better, particularly with the removal and extension types of modification to the original workflow.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.