Computational solutions for omics data

B Berger, J Peng, M Singh - Nature reviews genetics, 2013 - nature.com
High-throughput experimental technologies are generating increasingly massive and
complex genomic data sets. The sheer enormity and heterogeneity of these data threaten to …

Big data: astronomical or genomical?

ZD Stephens, SY Lee, F Faghri, RH Campbell… - PLoS …, 2015 - journals.plos.org
Genomics is a Big Data science and is going to get much bigger, very soon, but it is not
known whether the needs of genomics will exceed other Big Data domains. Projecting to the …

A comprehensive survey of cryptography key management systems

S Rana, FK Parast, B Kelly, Y Wang, KB Kent - Journal of Information …, 2023 - Elsevier
Cryptographic methods have been extensively employed in various systems to address
security objectives, such as data confidentiality, authentication, and secure communication …

The sequence read archive

R Leinonen, H Sugawara, M Shumway… - Nucleic acids …, 2010 - academic.oup.com
The combination of significantly lower cost and increased speed of sequencing has resulted
in an explosive growth of data submitted into the primary next-generation sequence data …

Finding a roadmap to achieve large neuromorphic hardware systems

J Hasler, B Marr - Frontiers in neuroscience, 2013 - frontiersin.org
Neuromorphic systems are gaining increasing importance in an era where CMOS digital
computing techniques are reaching physical limits. These silicon systems mimic extremely …

Efficient storage of high throughput DNA sequencing data using reference-based compression

MHY Fritz, R Leinonen, G Cochrane… - Genome research, 2011 - genome.cshlp.org
Data storage costs have become an appreciable proportion of total cost in the creation and
analysis of DNA sequence data. Of particular concern is that the rate of increase in DNA …

Compression of next-generation sequencing reads aided by highly efficient de novo assembly

DC Jones, WL Ruzzo, X Peng… - Nucleic acids research, 2012 - academic.oup.com
We present Quip, a lossless compression algorithm for next-generation sequencing data in
the FASTQ and SAM/BAM formats. In addition to implementing reference-based …

Relative Lempel-Ziv compression of genomes for large-scale storage and retrieval

S Kuruppu, SJ Puglisi, J Zobel - International Symposium on String …, 2010 - Springer
Self-indexes–data structures that simultaneously provide fast search of and access to
compressed text–are promising for genomic data but in their usual form are not able to …

Robust relative compression of genomes with random access

S Deorowicz, S Grabowski - Bioinformatics, 2011 - academic.oup.com
Motivation: Storing, transferring and maintaining genomic databases becomes a major
challenge because of the rapid technology progress in DNA sequencing and …

A survey on data compression methods for biological sequences

M Hosseini, D Pratas, AJ Pinho - Information, 2016 - mdpi.com
The ever increasing growth of the production of high-throughput sequencing data poses a
serious challenge to the storage, processing and transmission of these data. As frequently …