作者
Steven N Evans, Frederick A Matsen
发表日期
2012/6
期刊
Journal of the Royal Statistical Society Series B: Statistical Methodology
卷号
74
期号
3
页码范围
569-592
出版商
Oxford University Press
简介
It is now common to survey microbial communities by sequencing nucleic acid material extracted in bulk from a given environment. Comparative methods are needed that indicate the extent to which two communities differ given data sets of this type. UniFrac, which gives a somewhat ad hoc phylogenetics-based distance between two communities, is one of the most commonly used tools for these analyses. We provide a foundation for such methods by establishing that, if we equate a metagenomic sample with its empirical distribution on a reference phylogenetic tree, then the weighted UniFrac distance between two samples is just the classical Kantorovich–Rubinstein, or earth mover’s, distance between the corresponding empirical distributions. We demonstrate that this Kantorovich–Rubinstein distance and extensions incorporating uncertainty in the sample locations can be written as a readily computable …
引用总数
20122013201420152016201720182019202020212022202320245527881522111520227
学术搜索中的文章