作者
Md Rezaul Karim, Audrey Michel, Achille Zappa, Pavel Baranov, Ratnesh Sahay, Dietrich Rebholz-Schuhmann
发表日期
2018/9
来源
Briefings in bioinformatics
卷号
19
期号
5
页码范围
1035-1050
出版商
Oxford University Press
简介
Data workflow systems (DWFSs) enable bioinformatics researchers to combine components for data access and data analytics, and to share the final data analytics approach with their collaborators. Increasingly, such systems have to cope with large-scale data, such as full genomes (about 200 GB each), public fact repositories (about 100 TB of data) and 3D imaging data at even larger scales. As moving the data becomes cumbersome, the DWFS needs to embed its processes into a cloud infrastructure, where the data are already hosted. As the standardized public data play an increasingly important role, the DWFS needs to comply with Semantic Web technologies. This advancement to DWFS would reduce overhead costs and accelerate the progress in bioinformatics research based on large-scale data and public resources, as researchers would require less specialized IT knowledge for the …
引用总数
2017201820192020202120222023202413761431
学术搜索中的文章