Computational solutions to large-scale data management and analysis

EE Schadt, MD Linderman, J Sorenson, L Lee… - Nature reviews …, 2010 - nature.com
Today we can generate hundreds of gigabases of DNA and RNA sequencing data in a week
for less than US $5,000. The astonishing rate of data generation by these low-cost, high …

Big data, but are we ready?

O Trelles, P Prins, M Snir, RC Jansen - Nature Reviews Genetics, 2011 - nature.com
We welcome the timely Review by Schadt et al.(Computational solutions to large-scale data
management and analysis. Nature Rev. Genet. 11, 647–657 (2010)) 1, which presents cloud …

Cloud computing enabled big multi-omics data analytics

S Koppad, GV Gkoutos… - … and biology insights, 2021 - journals.sagepub.com
High-throughput experiments enable researchers to explore complex multifactorial diseases
through large-scale analysis of omics data. Challenges for such high-dimensional data sets …

[HTML][HTML] 'Big data', Hadoop and cloud computing in genomics

A O'Driscoll, J Daugelaite, RD Sleator - Journal of biomedical informatics, 2013 - Elsevier
Since the completion of the Human Genome project at the turn of the Century, there has
been an unprecedented proliferation of genomic sequence data. A consequence of this is …

Cloud computing for genomic data analysis and collaboration

B Langmead, A Nellore - Nature Reviews Genetics, 2018 - nature.com
Next-generation sequencing has made major strides in the past decade. Studies based on
large sequencing data sets are growing in number, and public archives for raw sequencing …

Cloud and heterogeneous computing solutions exist today for the emerging big data problems in biology

EE Schadt, MD Linderman, J Sorenson, L Lee… - Nature Reviews …, 2011 - nature.com
Figure 1| Applying a MapReduce approach in the cloud to solve embarrassingly
parallelizable problems. To traverse a 1 petabyte (PB) data set, Trelles et al. mistakenly …

Bionimbus: a cloud for managing, analyzing and sharing large genomics datasets

AP Heath, M Greenway, R Powell… - Journal of the …, 2014 - academic.oup.com
Background As large genomics and phenotypic datasets are becoming more common, it is
increasingly difficult for most researchers to access, manage, and analyze them. One …

Cloud computing and the DNA data race

MC Schatz, B Langmead, SL Salzberg - Nature biotechnology, 2010 - nature.com
Cloud computing and the DNA data race | Nature Biotechnology Skip to main content Thank
you for visiting nature.com. You are using a browser version with limited support for CSS. To …

[图书][B] Genomics in the cloud: using Docker, GATK, and WDL in Terra

GA Van der Auwera, BD O'Connor - 2020 - books.google.com
Data in the genomics field is booming. In just a few years, organizations such as the
National Institutes of Health (NIH) will host 50+ petabytes—or over 50 million gigabytes—of …

Data analysis: create a cloud commons

LD Stein, BM Knoppers, P Campbell, G Getz, JO Korbel - Nature, 2015 - nature.com
Data analysis: Create a cloud commons | Nature Skip to main content Thank you for visiting
nature.com. You are using a browser version with limited support for CSS. To obtain the best …