[PDF][PDF] Deep learning in population genetics

K Korfmann, OE Gaggiotti… - Genome Biology and …, 2023 - academic.oup.com
Population genetics is transitioning into a data-driven discipline thanks to the availability of
large-scale genomic data and the need to study increasingly complex evolutionary …

[HTML][HTML] The visual story of data storage: From storage properties to user interfaces

A Anžel, D Heider, G Hattab - Computational and Structural Biotechnology …, 2021 - Elsevier
About fifty times more data has been created than there are stars in the observable universe.
Current trends in data creation and consumption mean that the devices and storage media …

[HTML][HTML] Enhancing metagenomic classification with compression-based features

JM Silva, JR Almeida - Artificial Intelligence in Medicine, 2024 - Elsevier
Metagenomics is a rapidly expanding field that uses next-generation sequencing technology
to analyze the genetic makeup of environmental samples. However, accurately identifying …

AGC: compact representation of assembled genomes with fast queries and updates

S Deorowicz, A Danek, H Li - Bioinformatics, 2023 - academic.oup.com
Motivation High-quality sequence assembly is the ultimate representation of complete
genetic information of an individual. Several ongoing pangenome projects are producing …

DNACoder: a CNN-LSTM attention-based network for genomic sequence data compression

KS Sheena, MS Nair - Neural Computing and Applications, 2024 - Springer
Genomic sequencing has become increasingly prevalent, generating massive amounts of
data and facing a significant challenge in long-term storage and transmission. A solution that …

[PDF][PDF] Sousa

RCM PEREIRA - PA Fatores de mortalidade de micro e pequenas, 2018 - researchgate.net
The increasing availability of expressive quantities of human viral sequenced samples,
namely from clinical and forensic contexts, has led to the emergence of many optimized …

The complexity landscape of viral genomes

JM Silva, D Pratas, T Caetano, S Matos - GigaScience, 2022 - academic.oup.com
Background Viruses are among the shortest yet highly abundant species that harbor
minimal instructions to infect cells, adapt, multiply, and exist. However, with the current …

Efficient compression of SARS-CoV-2 genome data using Nucleotide Archival Format

K Kryukov, L Jin, S Nakagawa - Patterns, 2022 - cell.com
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genome data are essential
for epidemiology, vaccine development, and tracking emerging variants. Millions of SARS …

LEC-Codec: Learning-based genome data compression

Z Sun, M Wang, S Wang… - IEEE/ACM Transactions on …, 2024 - ieeexplore.ieee.org
In this paper, we propose a Learning-based gEnome Codec (LEC), which is designed for
high efficiency and enhanced flexibility. The LEC integrates several advanced technologies …

A generative nonparametric Bayesian model for whole genomes

A Amin, EN Weinstein, D Marks - Advances in Neural …, 2021 - proceedings.neurips.cc
Generative probabilistic modeling of biological sequences has widespread existing and
potential use across biology and biomedicine, particularly given advances in high …