Process-Mining techniques aim to use event data about past executions to gain insight into how processes are executed. While these techniques are proven to be very valuable, they are less successful to reach their goal if the process is flexible and, hence, it exhibits an extremely large number of variants. Furthermore, information systems can record events at very low level, which do not match the high-level concepts known at business level. Without abstracting sequences of events to high-level concepts, the results of applying process mining (to, e.g., discover a model) easily become very complex and difficult to interpret, which ultimately means that they are of little use. A large body of research exists on event abstraction but typically a large amount of domain knowledge is required, which is often not readily available. Other abstraction techniques are unsupervised, which ultimately return less accurate results and/or rely on stronger assumptions. This paper puts forward a technique that requires limited domain knowledge that can be easily provided. Traces are divided in batch sessions, and each session is abstracted as one single high-level activity execution. The abstraction is based on a combination of automatic clustering and visualization methods. The technique was assessed on two case studies about processes characterized by high variability. The results clearly illustrate the benefits of the abstraction to convey accurate knowledge to stakeholders.

Event-log abstraction using batch session identification and clustering

De Leoni M.;
2020

Abstract

Process-Mining techniques aim to use event data about past executions to gain insight into how processes are executed. While these techniques are proven to be very valuable, they are less successful to reach their goal if the process is flexible and, hence, it exhibits an extremely large number of variants. Furthermore, information systems can record events at very low level, which do not match the high-level concepts known at business level. Without abstracting sequences of events to high-level concepts, the results of applying process mining (to, e.g., discover a model) easily become very complex and difficult to interpret, which ultimately means that they are of little use. A large body of research exists on event abstraction but typically a large amount of domain knowledge is required, which is often not readily available. Other abstraction techniques are unsupervised, which ultimately return less accurate results and/or rely on stronger assumptions. This paper puts forward a technique that requires limited domain knowledge that can be easily provided. Traces are divided in batch sessions, and each session is abstracted as one single high-level activity execution. The abstraction is based on a combination of automatic clustering and visualization methods. The technique was assessed on two case studies about processes characterized by high variability. The results clearly illustrate the benefits of the abstraction to convey accurate knowledge to stakeholders.
2020
Proceedings of the ACM Symposium on Applied Computing
35th Annual ACM Symposium on Applied Computing, SAC 2020
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11577/3341305
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 31
  • ???jsp.display-item.citation.isi??? 18
  • OpenAlex ND
social impact