查看文章

psu.edu 中的 [PDF]

The mutual information: detecting and evaluating dependencies between variables

作者

Ralf Steuer, Jürgen Kurths, Carsten O Daub, Janko Weise, Joachim Selbig

发表日期

2002/10

期刊

Bioinformatics

卷号

期号

suppl_2

页码范围

S231-S240

出版商

Oxford University Press

简介

Motivation: Clustering co-expressed genes usually requires the definition of ‘distance’or ‘similarity’between measured datasets, the most common choices being Pearson correlation or Euclidean distance. With the size of available datasets steadily increasing, it has become feasible to consider other, more general, definitions as well. One alternative, based on information theory, is the mutual information, providing a general measure of dependencies between variables. While the use of mutual information in cluster analysis and visualization of large-scale gene expression data has been suggested previously, the earlier studies did not focus on comparing different algorithms to estimate the mutual information from finite data.

Results: Here we describe and review several approaches to estimate the mutual information from finite datasets. Our findings show that the algorithms used so far may be quite substantially improved upon. In particular when dealing with small datasets, finite sample effects and other sources of potentially misleading results have to be taken into account.

Contact: steuer@ agnld. uni-potsdam. de

引用总数

被引用次数：988

20032004200520062007200820092010201120122013201420152016201720182019202020212022202320245 13 17 22 27 40 35 47 47 46 73 65 68 61 53 78 54 46 44 60 45 38

学术搜索中的文章

The mutual information: detecting and evaluating dependencies between variables

R Steuer, J Kurths, CO Daub, J Weise, J Selbig - Bioinformatics, 2002

被引用次数：988 相关文章所有 11 个版本