Comparing Trace Similarity Metrics Across Logs and Evaluation Measures

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Standard

Comparing Trace Similarity Metrics Across Logs and Evaluation Measures. / Back, Christoffer Olling; Simonsen, Jakob Grue.

Advanced Information Systems Engineering: 35th International Conference, CAiSE 2023, Zaragoza, Spain, June 12–16, 2023, Proceedings. ed. / Marta Indulska; Iris Reinhartz-Berger; Carlos Cetina; Oscar Pastor. Springer, 2023. p. 226-242 (Lecture Notes in Computer Science, Vol. 13901)).

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Harvard

Back, CO & Simonsen, JG 2023, Comparing Trace Similarity Metrics Across Logs and Evaluation Measures. in M Indulska, I Reinhartz-Berger, C Cetina & O Pastor (eds), Advanced Information Systems Engineering: 35th International Conference, CAiSE 2023, Zaragoza, Spain, June 12–16, 2023, Proceedings. Springer, Lecture Notes in Computer Science, vol. 13901), pp. 226-242, 35th International Conference on Advanced Information Systems Engineering, CAiSE 2023, Zaragoza, Spain, 12/06/2023. https://doi.org/10.1007/978-3-031-34560-9_14

APA

Back, C. O., & Simonsen, J. G. (2023). Comparing Trace Similarity Metrics Across Logs and Evaluation Measures. In M. Indulska, I. Reinhartz-Berger, C. Cetina, & O. Pastor (Eds.), Advanced Information Systems Engineering: 35th International Conference, CAiSE 2023, Zaragoza, Spain, June 12–16, 2023, Proceedings (pp. 226-242). Springer. Lecture Notes in Computer Science Vol. 13901) https://doi.org/10.1007/978-3-031-34560-9_14

Vancouver

Back CO, Simonsen JG. Comparing Trace Similarity Metrics Across Logs and Evaluation Measures. In Indulska M, Reinhartz-Berger I, Cetina C, Pastor O, editors, Advanced Information Systems Engineering: 35th International Conference, CAiSE 2023, Zaragoza, Spain, June 12–16, 2023, Proceedings. Springer. 2023. p. 226-242. (Lecture Notes in Computer Science, Vol. 13901)). https://doi.org/10.1007/978-3-031-34560-9_14

Author

Back, Christoffer Olling ; Simonsen, Jakob Grue. / Comparing Trace Similarity Metrics Across Logs and Evaluation Measures. Advanced Information Systems Engineering: 35th International Conference, CAiSE 2023, Zaragoza, Spain, June 12–16, 2023, Proceedings. editor / Marta Indulska ; Iris Reinhartz-Berger ; Carlos Cetina ; Oscar Pastor. Springer, 2023. pp. 226-242 (Lecture Notes in Computer Science, Vol. 13901)).

Bibtex

@inproceedings{ae1ad85c96cd4b35b9d4a67ac26a3284,
title = "Comparing Trace Similarity Metrics Across Logs and Evaluation Measures",
abstract = "Trace similarity is a prerequisite for several process mining tasks, e.g. identifying process variants and anomalies. Many similarity metrics have been presented in the literature, but the similarity metric itself is seldom subject to controlled evaluation. Instead, they are usually demonstrated in conjunction with downstream tasks, e.g. process model discovery, and often evaluated qualitatively or with limited comparison. In this paper, we isolate similarity metrics from downstream tasks and compare them wrt. evaluation measures adapted from metric learning and clustering literature. We present a comparison of 18 similarity metrics across 4 evaluation measures and 12 event logs. Friedman and Nemenyi tests for statistical significance show that certain similarity metrics consistently outperform on some evaluation measures, but their mean rank varies across evaluation measures. One similarity metric based on a weighted eventually-follows relation does stand out as consistently outperforming, and the simplest n-gram similarity metrics also perform well. Our results demonstrate that choice of evaluation measures will determine the contours of the metric that are revealed. This study may be harnessed as a baseline for benchmarking future work on trace similarity, and describes tools for quantitative evaluation that we hope will inspire empirical rigor in future work.",
keywords = "Empirical Evaluation, Process Mining, Similarity Metric",
author = "Back, {Christoffer Olling} and Simonsen, {Jakob Grue}",
note = "Supported by Innovation Fund Denmark as part of DIREC initiative; 35th International Conference on Advanced Information Systems Engineering, CAiSE 2023 ; Conference date: 12-06-2023 Through 16-06-2023",
year = "2023",
doi = "10.1007/978-3-031-34560-9_14",
language = "English",
isbn = "978-3-031-34559-3",
series = "Lecture Notes in Computer Science",
publisher = "Springer",
pages = "226--242",
editor = "Marta Indulska and Iris Reinhartz-Berger and Carlos Cetina and Oscar Pastor",
booktitle = "Advanced Information Systems Engineering",
address = "Switzerland",

}

RIS

TY - GEN

T1 - Comparing Trace Similarity Metrics Across Logs and Evaluation Measures

AU - Back, Christoffer Olling

AU - Simonsen, Jakob Grue

N1 - Supported by Innovation Fund Denmark as part of DIREC initiative

PY - 2023

Y1 - 2023

N2 - Trace similarity is a prerequisite for several process mining tasks, e.g. identifying process variants and anomalies. Many similarity metrics have been presented in the literature, but the similarity metric itself is seldom subject to controlled evaluation. Instead, they are usually demonstrated in conjunction with downstream tasks, e.g. process model discovery, and often evaluated qualitatively or with limited comparison. In this paper, we isolate similarity metrics from downstream tasks and compare them wrt. evaluation measures adapted from metric learning and clustering literature. We present a comparison of 18 similarity metrics across 4 evaluation measures and 12 event logs. Friedman and Nemenyi tests for statistical significance show that certain similarity metrics consistently outperform on some evaluation measures, but their mean rank varies across evaluation measures. One similarity metric based on a weighted eventually-follows relation does stand out as consistently outperforming, and the simplest n-gram similarity metrics also perform well. Our results demonstrate that choice of evaluation measures will determine the contours of the metric that are revealed. This study may be harnessed as a baseline for benchmarking future work on trace similarity, and describes tools for quantitative evaluation that we hope will inspire empirical rigor in future work.

AB - Trace similarity is a prerequisite for several process mining tasks, e.g. identifying process variants and anomalies. Many similarity metrics have been presented in the literature, but the similarity metric itself is seldom subject to controlled evaluation. Instead, they are usually demonstrated in conjunction with downstream tasks, e.g. process model discovery, and often evaluated qualitatively or with limited comparison. In this paper, we isolate similarity metrics from downstream tasks and compare them wrt. evaluation measures adapted from metric learning and clustering literature. We present a comparison of 18 similarity metrics across 4 evaluation measures and 12 event logs. Friedman and Nemenyi tests for statistical significance show that certain similarity metrics consistently outperform on some evaluation measures, but their mean rank varies across evaluation measures. One similarity metric based on a weighted eventually-follows relation does stand out as consistently outperforming, and the simplest n-gram similarity metrics also perform well. Our results demonstrate that choice of evaluation measures will determine the contours of the metric that are revealed. This study may be harnessed as a baseline for benchmarking future work on trace similarity, and describes tools for quantitative evaluation that we hope will inspire empirical rigor in future work.

KW - Empirical Evaluation

KW - Process Mining

KW - Similarity Metric

U2 - 10.1007/978-3-031-34560-9_14

DO - 10.1007/978-3-031-34560-9_14

M3 - Article in proceedings

AN - SCOPUS:85164023434

SN - 978-3-031-34559-3

T3 - Lecture Notes in Computer Science

SP - 226

EP - 242

BT - Advanced Information Systems Engineering

A2 - Indulska, Marta

A2 - Reinhartz-Berger, Iris

A2 - Cetina, Carlos

A2 - Pastor, Oscar

PB - Springer

T2 - 35th International Conference on Advanced Information Systems Engineering, CAiSE 2023

Y2 - 12 June 2023 through 16 June 2023

ER -

ID: 387382847