Scalable pathogen pipeline platform (SP^ 3): enabling unified genomic data analysis with elastic cloud computing

F Yang-Turner, D Volk, P Fowler… - 2019 IEEE 12th …, 2019 - ieeexplore.ieee.org
F Yang-Turner, D Volk, P Fowler, J Swann, M Bull, S Hoosdally, T Connor, T Peto, D Crook
2019 IEEE 12th International Conference on Cloud Computing (CLOUD), 2019ieeexplore.ieee.org
Pathogen genomic data analysis can be extremely bespoke and diverse. This paper
presents our plan and progress towards creating a Scalable Pathogen Pipeline Platform (SP
3) providing an efficient and unified process of collecting, analysing and comparing genomic
data analysis with the benefit of elastic cloud computing. SP 3 enables container-centric
bioinformatic workflows run on personal computers, High-performance computing (HPC)
clusters and cloud platforms. We have deployed and tested SP 3 on local HPC, Google …
Pathogen genomic data analysis can be extremely bespoke and diverse. This paper presents our plan and progress towards creating a Scalable Pathogen Pipeline Platform (SP 3 ) providing an efficient and unified process of collecting, analysing and comparing genomic data analysis with the benefit of elastic cloud computing. SP 3 enables container-centric bioinformatic workflows run on personal computers, High-performance computing (HPC) clusters and cloud platforms. We have deployed and tested SP 3 on local HPC, Google Cloud Platform (GCP), Microsoft Azure and OpenStack Platforms. SP 3 allows users to fetch genomic sequencing data from European Nucleotide Archive (ENA) and conduct analysis with open-source bioinformatic pipelines. We believe SP 3 will promote common standards around pathogen genomic data quality, data processing and data analysis, helping answer the challenges of tools divergence and leveraging a pool of public genomic data repository and cloud resources.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果