[PDF][PDF] A simple guide to de novo transcriptome assembly and annotation

V Raghavan, L Kraft, F Mesny… - Briefings in …, 2022 - academic.oup.com
A transcriptome constructed from short-read RNA sequencing (RNA-seq) is an easily
attainable proxy catalog of protein-coding genes when genome assembly is unnecessary …

Metagenome analysis using the Kraken software suite

J Lu, N Rincon, DE Wood, FP Breitwieser… - Nature protocols, 2022 - nature.com
Metagenomic experiments expose the wide range of microscopic organisms in any
microbial environment through high-throughput DNA sequencing. The computational …

Large language models generate functional protein sequences across diverse families

A Madani, B Krause, ER Greene, S Subramanian… - Nature …, 2023 - nature.com
Deep-learning language models have shown promise in various biotechnological
applications, including protein design and engineering. Here we describe ProGen, a …

Clustering predicted structures at the scale of the known protein universe

I Barrio-Hernandez, J Yeo, J Jänes, M Mirdita… - Nature, 2023 - nature.com
Proteins are key to all cellular processes and their structure is important in understanding
their function and evolution. Sequence-based predictions of protein structures have …

Unraveling the functional dark matter through global metagenomics

GA Pavlopoulos, FA Baltoumas, S Liu, O Selvitopi… - Nature, 2023 - nature.com
Metagenomes encode an enormous diversity of proteins, reflecting a multiplicity of functions
and activities,. Exploration of this vast sequence space has been limited to a comparative …

A deep siamese neural network improves metagenome-assembled genomes in microbiome datasets across different environments

S Pan, C Zhu, XM Zhao, LP Coelho - Nature communications, 2022 - nature.com
Metagenomic binning is the step in building metagenome-assembled genomes (MAGs)
when sequences predicted to originate from the same genome are automatically grouped …

GUNC: detection of chimerism and contamination in prokaryotic genomes

A Orakov, A Fullam, LP Coelho, S Khedkar… - Genome biology, 2021 - Springer
Genomes are critical units in microbiology, yet ascertaining quality in prokaryotic genome
assemblies remains a formidable challenge. We present GUNC (the Genome UNClutterer) …

Phage family classification under Caudoviricetes: A review of current tools using the latest ICTV classification framework

Y Zhu, J Shang, C Peng, Y Sun - Frontiers in microbiology, 2022 - frontiersin.org
Bacteriophages, which are viruses infecting bacteria, are the most ubiquitous and diverse
entities in the biosphere. There is accumulating evidence revealing their important roles in …

Contamination detection in genomic data: more is not enough

L Cornet, D Baurain - Genome Biology, 2022 - Springer
The decreasing cost of sequencing and concomitant augmentation of publicly available
genomes have created an acute need for automated software to assess genomic …

Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets

DM Portik, CT Brown, NT Pierce-Ward - BMC bioinformatics, 2022 - Springer
Background Long-read shotgun metagenomic sequencing is gaining in popularity and offers
many advantages over short-read sequencing. The higher information content in long reads …