Event log preprocessing for process mining: a review

HM Marin-Castro, E Tello-Leal - Applied Sciences, 2021 - mdpi.com
Process Mining allows organizations to obtain actual business process models from event
logs (discovery), to compare the event log or the resulting process model in the discovery …

Time series data imputation: A survey on deep learning approaches

C Fang, C Wang - arXiv preprint arXiv:2011.11347, 2020 - arxiv.org
Time series are all around in real-world applications. However, unexpected accidents for
example broken sensors or missing of the signals will cause missing values in time series …

Data dependencies extended for variety and veracity: A family tree

S Song, F Gao, R Huang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Besides the conventional schema-oriented tasks, data dependencies are recently revisited
for data quality applications, such as violation detection, data repairing and record matching …

Quality-informed process mining: A case for standardised data quality annotations

K Goel, SJJ Leemans, N Martin, MT Wynn - ACM Transactions on …, 2022 - dl.acm.org
Real-life event logs, reflecting the actual executions of complex business processes, are
faced with numerous data quality issues. Extensive data sanity checks and pre-processing …

Sampling for big data profiling: A survey

Z Liu, A Zhang - IEEE access, 2020 - ieeexplore.ieee.org
Due to the development of internet technology and computer science, data is exploding at
an exponential rate. Big data brings us new opportunities and challenges. On the one hand …

IoT data quality

S Song, A Zhang - Proceedings of the 29th ACM International …, 2020 - dl.acm.org
Data quality issues have been widely recognized in IoT data, and prevent the downstream
applications. In this tutorial, we review the state-of-the-art techniques for IoT data quality …

Efficiently cleaning structured event logs: a graph repair approach

R Huang, J Wang, S Song, X Lin, X Zhu… - ACM Transactions on …, 2023 - dl.acm.org
Event data are often dirty owing to various recording conventions or simply system errors.
These errors may cause serious damage to real applications, such as inaccurate …

Data quality in process mining

N Martin - Interactive process mining in healthcare, 2021 - Springer
To cope with challenges such as tightening budgets and increased care needs, healthcare
organizations are becoming increasingly aware of the need to understand their processes in …

A survey of approximate quantile computation on large-scale data

Z Chen, A Zhang - IEEE Access, 2020 - ieeexplore.ieee.org
As data volume grows extensively, data profiling helps to extract metadata of large-scale
data. However, one kind of metadata, order statistics, is difficult to be computed because …

Event Log Data Quality Issues and Solutions

D Dakic, D Stefanovic, T Vuckovic, M Zizakov… - Mathematics, 2023 - mdpi.com
Process mining is a discipline that analyzes real event data extracted from information
systems that support a business process to construct as-is process models and detect …