Top-k Self-Adaptive Contrast Sequential Pattern Mining

Y Wu, Y Wang, Y Li, X Zhu, X Wu - IEEE transactions on …, 2021 - ieeexplore.ieee.org
For sequence classification, an important issue is to find discriminative features, where
sequential pattern mining (SPM) is often used to find frequent patterns from sequences as …

Efficient mining of the most significant patterns with permutation testing

L Pellegrina, F Vandin - Proceedings of the 24th ACM SIGKDD …, 2018 - dl.acm.org
The extraction of patterns displaying significant association with a class label is a key data
mining task with wide application in many domains. We study a variant of the problem that …

SPEck: mining statistically-significant sequential patterns efficiently with exact sampling

S Jenkins, S Walzer-Goldfeld, M Riondato - Data Mining and Knowledge …, 2022 - Springer
We study the problem of efficiently mining statistically-significant sequential patterns from
large datasets, under different null models. We consider one null model presented in the …

Efficient centrality maximization with Rademacher averages

L Pellegrina - Proceedings of the 29th ACM SIGKDD Conference on …, 2023 - dl.acm.org
The identification of the set of k most central nodes of a graph, or centrality maximization, is a
key task in network analysis, with various applications ranging from finding communities in …

Seq2Pat: Sequence‐to‐pattern generation to bridge pattern mining with machine learning

S Kadıoğlu, X Wang, A Hosseininasab… - AI …, 2023 - Wiley Online Library
Pattern mining is an essential part of knowledge discovery and data analytics. It is a
powerful paradigm, especially when combined with constraint reasoning. In this overview …

[HTML][HTML] Dichotomic pattern mining integrated with constraint reasoning for digital behavior analysis

S Ghosh, S Yadav, X Wang, B Chakrabarty… - Frontiers in Artificial …, 2022 - frontiersin.org
Sequential pattern mining remains a challenging task due to the large number of redundant
candidate patterns and the exponential search space. In addition, further analysis is still …

ROhAN: Row-order agnostic null models for statistically-sound knowledge discovery

M Abuissa, A Lee, M Riondato - Data Mining and Knowledge Discovery, 2023 - Springer
We introduce a novel class of null models for the statistical validation of results obtained
from binary transactional and sequence datasets. Our null models are Row-Order Agnostic …

Statistically-sound Knowledge Discovery from Data: Challenges and Directions

M Riondato - 2023 IEEE 5th International Conference on …, 2023 - ieeexplore.ieee.org
We describe Statistically-sound Knowledge Discovery from Data (StatKDD), a
groundbreaking change of paradigm that shifts the focus of the KDD pipeline from the …

Alice  and the Caterpillar: A more descriptive null model for assessing data mining results

G Preti, G de Francisci Morales, M Riondato - Knowledge and Information …, 2024 - Springer
We introduce novel null models for assessing the results obtained from observed binary
transactional and sequence datasets, using statistical hypothesis testing. Our null models …

SILVAN: Estimating Betweenness Centralities with Progressive Sampling and Non-uniform Rademacher Bounds

L Pellegrina, F Vandin - … Transactions on Knowledge Discovery from Data, 2023 - dl.acm.org
“Sim Sala Bim!”—Silvan, https://en. wikipedia. org/wiki/Silvan_ (illusionist) Betweenness
centrality is a popular centrality measure with applications in several domains and whose …