Several organizations maintain textual process descriptions alongside graphical process descriptions to make them usable for all stakeholders. Maintaining textual process descriptions in the presence of continuously changing processes is a labor-intensive task. Therefore, the automatic generation of textual descriptions is desirable. However, the trade-offs between the manual and automatic generation of descriptions are yet to be investigated. To that end, this paper aims to answer two vital questions. How similar are the descriptions generated by the two approaches?What is the impact of using the two types of descriptions on process matching? To answer these specific questions, we have generated textual descriptions of 552 process models using the two approaches. To answer the first question, we have applied six text-matching techniques and established that the descriptions overlap significantly; however, the formulation of sentences is substantially different. For answering the second question, we have used 11 text-matching techniques to evaluate the impact of both descriptions on process matching. Results show (a) the choice of matching technique, and the type of description, have an impact on the matching performance and (b) vector space model (VSM) is the most appropriate matching technique whereas 5 gram is the worst performing technique.
KEYWORDSbusiness process management, empirical study, information retrieval, software engineering, textual process description
INTRODUCTIONBusiness process models play a critical role to comprehensively document the workflow of organizations. 1,2 These models have proven to be a valuable artifact as they enable formalizing the functional requirements that should be supported by software systems. 3,4 Figure 1 shows an example process model expressed in a widely used modeling language, business process model and notation (BPMN). The example model is composed of eight activities, two AND gateways, and two XOR gateways. The activities are depicted by rounded edged rectangles, whereas AND gateways are depicted by a diamond shape having (+) sign inside it. The AND gateways represent the concurrent execution of activities. In contrast, XOR gateways, depicted by a diamond shape with a (×) sign inside it, represents the decision point of subsequent activities.Process models are widely acknowledged as an effective tool for the visualization of business operations. 5 However, recognizing that some stakeholders have difficulties in reading and interpreting process models, organizations maintain textual process descriptions alongside graphical counterparts. 2,6-9 The textual process descriptions use natural language text to outline the specifications of business processes. While such descriptions may not be suitable to represent complex aspects of a process in a precise manner, they can be created, maintained and understood by virtually everyone. 10 Recent studies have established that the comprehension of process specification increases among business professionals and technical exper...