[PDF][PDF] 大数据相关分析综述

梁吉业, 冯晨娇, 宋鹏 - 计算机学报, 2016 - jiyeliang.net
摘要大数据时代, 相关分析因其具有可以快捷, 高效地发现事物间内在关联的优势而受到广泛的
关注, 并有效地应用于推荐系统, 商业分析, 公共管理, 医疗诊断等领域. 面向非线性 …

A fast algorithm for computing distance correlation

A Chaudhuri, W Hu - Computational statistics & data analysis, 2019 - Elsevier
Classical dependence measures such as Pearson correlation, Spearman's ρ, and Kendall's
τ can detect only monotonic or linear dependence. To overcome these limitations, Székely et …

Universal dependency analysis

HV Nguyen, P Mandros, J Vreeken - Proceedings of the 2016 SIAM …, 2016 - SIAM
Most data is multi-dimensional. Discovering whether any subset of dimensions, or
subspaces, shows dependence is a core task in data mining. To do so, we require a …

Local dependent components

A Klami, S Kaski - Proceedings of the 24th international conference on …, 2007 - dl.acm.org
We introduce a mixture of probabilistic canonical correlation analyzers model for analyzing
local correlations, or more generally mutual statistical dependencies, in cooccurring data …

Using transitivity to increase the accuracy of sample-based Pearson correlation coefficients

T Phillips, C GauthierDickey, R Thurimella - Data Warehousing and …, 2010 - Springer
Pearson product-moment correlation coefficients are a well-practiced quantification of linear
dependence seen across many fields. When calculating a sample-based correlation …

A general framework for association analysis of heterogeneous data

G Li, I Gaynanova - 2018 - projecteuclid.org
Supplementary Material for A General Framework for Association Analysis of
Heterogeneous Data. We provide proofs, technical details of the algorithm, a detailed …

Dependence maps, a dimensionality reduction with dependence distance for high-dimensional data

K Lee, A Gray, H Kim - Data Mining and Knowledge Discovery, 2013 - Springer
We introduce the dependence distance, a new notion of the intrinsic distance between
points, derived as a pointwise extension of statistical dependence measures between …

[HTML][HTML] ennemi: Non-linear correlation detection with mutual information

P Laarne, MA Zaidan, T Nieminen - SoftwareX, 2021 - Elsevier
We present ennemi, a Python package for correlation analysis based on mutual information
(MI). MI is a measure of relationship between variables. Unlike Pearson correlation it is valid …

Detecting associations based on the multi-variable maximum information coefficient

T Gu, J Guo, Z Li, S Mao - IEEE Access, 2021 - ieeexplore.ieee.org
The maximum information coefficient (MIC) is a novel and widely-using measure of
association detection in large datasets. The most outstanding feature of MIC is that it has …

A survey on canonical correlation analysis

X Yang, W Liu, W Liu, D Tao - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
In recent years, the advances in data collection and statistical analysis promotes canonical
correlation analysis (CCA) available for more advanced research. CCA is the main …