Statistical aspects of Wasserstein distances

VM Panaretos, Y Zemel - Annual review of statistics and its …, 2019 - annualreviews.org
Wasserstein distances are metrics on probability distributions inspired by the problem of
optimal mass transportation. Roughly speaking, they measure the minimal effort required to …

Microbiome, metagenomics, and high-dimensional compositional data analysis

H Li - Annual Review of Statistics and Its Application, 2015 - annualreviews.org
The human microbiome is the totality of all microbes in and on the human body, and its
importance in health and disease has been increasingly recognized. High-throughput …

EPA-ng: massively parallel evolutionary placement of genetic sequences

P Barbera, AM Kozlov, L Czech, B Morel… - Systematic …, 2019 - academic.oup.com
Next generation sequencing (NGS) technologies have led to a ubiquity of molecular
sequence data. This data avalanche is particularly challenging in metagenetics, which …

Genesis and Gappa: processing, analyzing and visualizing phylogenetic (placement) data

L Czech, P Barbera, A Stamatakis - Bioinformatics, 2020 - academic.oup.com
We present genesis, a library for working with phylogenetic data, and gappa, an
accompanying command-line tool for conducting typical analyses on such data. The tools …

[HTML][HTML] PhyloSift: phylogenetic analysis of genomes and metagenomes

AE Darling, G Jospin, E Lowe, FA Matsen IV, HM Bik… - PeerJ, 2014 - peerj.com
Like all organisms on the planet, environmental microbes are subject to the forces of
molecular evolution. Metagenomic sequencing provides a means to access the DNA …

Bacterial communities in women with bacterial vaginosis: high resolution phylogenetic analyses reveal relationships of microbiota to clinical criteria

S Srinivasan, NG Hoffman, MT Morgan, FA Matsen… - PloS one, 2012 - journals.plos.org
Background Bacterial vaginosis (BV) is a common condition that is associated with
numerous adverse health outcomes and is characterized by poorly understood changes in …

[HTML][HTML] Trellis tree-based analysis reveals stromal regulation of patient-derived organoid drug responses

MR Zapatero, A Tong, JW Opzoomer, R O'Sullivan… - Cell, 2023 - cell.com
Patient-derived organoids (PDOs) can model personalized therapy responses; however,
current screening technologies cannot reveal drug response mechanisms or how tumor …

pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree

FA Matsen, RB Kodner, EV Armbrust - BMC bioinformatics, 2010 - Springer
Background Likelihood-based phylogenetic inference is generally considered to be the most
reliable classification method for unknown sequences. However, traditional likelihood-based …

Inference for empirical Wasserstein distances on finite spaces

M Sommerfeld, A Munk - Journal of the Royal Statistical Society …, 2018 - academic.oup.com
The Wasserstein distance is an attractive tool for data analysis but statistical inference is
hindered by the lack of distributional limits. To overcome this obstacle, for probability …

[HTML][HTML] Microbiome preterm birth DREAM challenge: crowdsourcing machine learning approaches to advance preterm birth research

JL Golob, TT Oskotsky, AS Tang, A Roldan… - Cell Reports …, 2024 - cell.com
Every year, 11% of infants are born preterm with significant health consequences, with the
vaginal microbiome a risk factor for preterm birth. We crowdsource models to predict (1) …