The sourmash software package uses MinHash-based sketching to create “signatures”, compressed representations of DNA, RNA, and protein sequences, that can be stored …
As computational biologists continue to be inundated by ever increasing amounts of metagenomic data, the need for data analysis approaches that keep up with the pace of …
Considerable advances in genomics over the past decade have resulted in vast amounts of data being generated and deposited in global archives. The growth of these archives …
Alterations in the human microbiome have been observed in a variety of conditions such as asthma, gingivitis, dermatitis and cancer, and much remains to be learned about the links …
Genome search and/or classification typically involves finding the best-match database (reference) genomes and has become increasingly challenging due to the growing number …
As the scale of biological data generation has increased, the bottleneck of research has shifted from data generation to analysis. Researchers commonly need to build …
Mapping metagenome reads to reference databases is the standard approach for assessing microbial taxonomic and functional diversity from metagenomic data. However, public …
The human gut microbiome is an intricate ecosystem with profound implications for host metabolism, immune function, and neuroendocrine activity. Over the years, studies have …
O Ertl - IEEE Transactions on Knowledge and Data …, 2020 - ieeexplore.ieee.org
The probability Jaccard similarity was recently proposed as a natural generalization of the Jaccard similarity to measure the proximity of sets whose elements are associated with …