The deluge of spurious correlations in big data

CS Calude, G Longo - Foundations of science, 2017 - Springer
Very large databases are a major opportunity for science and data analytics is a remarkable
new field of investigation in computer science. The effectiveness of these tools is used to …

[引用][C] A survey on correlation analysis of big data

JY Liang, CJ Feng, P Song - Chinese Journal of Computers, 2016

Comment on" Detecting Novel Associations In Large Data Sets" by Reshef Et Al, Science Dec 16, 2011

N Simon, R Tibshirani - arXiv preprint arXiv:1401.7645, 2014 - arxiv.org
The proposal of Reshef et al.(2011) is an interesting new approach for discovering non-
linear dependencies among pairs of measurements in exploratory data mining. However, it …

KM-MIC: An improved maximum information coefficient based on K-Medoids clustering

Y Zhang, P Shang - Communications in Nonlinear Science and Numerical …, 2022 - Elsevier
In order to measure whether and how things are related, statistical correlation analysis
comes into being. Among them, Pearson coefficients, Spearman and Kendall coefficients …

Care: Finding local linear correlations in high dimensional data

X Zhang, F Pan, W Wang - 2008 IEEE 24th International …, 2008 - ieeexplore.ieee.org
Finding latent patterns in high dimensional data is an important research problem with
numerous applications. Existing approaches can be summarized into 3 categories: feature …

Unbiased multivariate correlation analysis

Y Wang, S Romano, V Nguyen, J Bailey… - Proceedings of the …, 2017 - ojs.aaai.org
Correlation measures are a key element of statistics and machine learning, and essential for
a wide range of data analysis tasks. Most existing correlation measures are for pairwise …

[图书][B] Cocoa: Correlation coefficient-aware data augmentation

M Esmailoghli, JA Quiané-Ruiz, Z Abedjan - 2021 - repo.uni-hannover.de
Calculating correlation coefficients is one of the most used measures in data science.
Although linear correlations are fast and easy to calculate, they lack robustness and …

Copula-based high dimensional cross-market dependence modeling

J Xu, W Wei, L Cao - … on Data Science and Advanced Analytics …, 2017 - ieeexplore.ieee.org
Dependence across multiple financial markets, such as stock and foreign exchange rate
markets, is high-dimensional, contains various relationships, and often presents complicated …

Association analysis for visual exploration of multivariate scientific data sets

X Liu, HW Shen - IEEE transactions on visualization and …, 2015 - ieeexplore.ieee.org
The heterogeneity and complexity of multivariate characteristics poses a unique challenge
to visual exploration of multivariate scientific data sets, as it requires investigating the usually …

A fast algorithm for computing distance correlation

A Chaudhuri, W Hu - Computational statistics & data analysis, 2019 - Elsevier
Classical dependence measures such as Pearson correlation, Spearman's ρ, and Kendall's
τ can detect only monotonic or linear dependence. To overcome these limitations, Székely et …