[PDF][PDF] Permutation tests for studying classifier performance.

M Ojala, GC Garriga - Journal of machine learning research, 2010 - jmlr.org
We explore the framework of permutation-based p-values for assessing the performance of
classifiers. In this paper we study two simple permutation tests. The first test assess whether …

Knowledge discovery interestingness measures based on unexpectedness

KN Kontonasios, E Spyropoulou… - … reviews: data mining …, 2012 - Wiley Online Library
Abstract Knowledge discovery methods often discover a large number of patterns. Although
this can be considered of interest, it certainly presents considerable challenges too. Indeed …

Brain-computer interface for generating personally attractive images

M Spape, KM Davis, L Kangassalo… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
While we instantaneously recognize a face as attractive, it is much harder to explain what
exactly defines personal attraction. This suggests that attraction depends on implicit …

The wiring economy principle: connectivity determines anatomy in the human brain

A Raj, Y Chen - PloS one, 2011 - journals.plos.org
Minimization of the wiring cost of white matter fibers in the human brain appears to be an
organizational principle. We investigate this aspect in the human brain using whole brain …

The blind men and the elephant: on meeting the problem of multiple truths in data from clustering and pattern mining perspectives

A Zimek, J Vreeken - Machine Learning, 2015 - Springer
In this position paper, we discuss how different branches of research on clustering and
pattern mining, while rather different at first glance, in fact have a lot in common and can …

From black and white to full color: extending redescription mining outside the Boolean world

E Galbrun, P Miettinen - … Analysis and Data Mining: The ASA …, 2012 - Wiley Online Library
Redescription mining is a powerful data analysis tool that is used to find multiple
descriptions of the same entities. Consider geographical regions as an example. They can …

A statistical significance testing approach to mining the most informative set of patterns

J Lijffijt, P Papapetrou, K Puolamäki - Data Mining and Knowledge …, 2014 - Springer
Hypothesis testing using constrained null models can be used to compute the significance of
data mining results given what is already known about the data. We study the novel problem …

Multiple hypothesis testing in pattern discovery

S Hanhijärvi - Discovery Science: 14th International Conference, DS …, 2011 - Springer
The problem of multiple hypothesis testing arises when there are more than one hypothesis
to be tested simultaneously for statistical significance. This is a very common situation in …

Data mining of temporal sequences for the prediction of infrequent failure events: application on floating train data for predictive maintenance

W Sammouri - 2014 - theses.hal.science
In order to meet the mounting social and economic demands, railway operators and
manufacturers are striving for a longer availability and a better reliability of railway …

Evaluating query result significance in databases via randomizations

M Ojala, GC Garriga, A Gionis, H Mannila - Proceedings of the 2010 SIAM …, 2010 - SIAM
Many sorts of structured data are commonly stored in a multi-relational format of interrelated
tables. Under this relational model, exploratory data analysis can be done by using …