Big data resource management & networks: Taxonomy, survey, and future directions

FM Awaysheh, M Alazab, S Garg… - … Surveys & Tutorials, 2021 - ieeexplore.ieee.org
Big Data (BD) platforms have a long tradition of leveraging trends and technologies from the
broader computer network and communication community. For several years, dedicated …

Priority research directions for in situ data management: Enabling scientific discovery from diverse data sources

T Peterka, D Bard, JC Bennett… - … Journal of High …, 2020 - journals.sagepub.com
In January 2019, the US Department of Energy, Office of Science program in Advanced
Scientific Computing Research, convened a workshop to identify priority research directions …

FlinkCL: An OpenCL-based in-memory computing architecture on heterogeneous CPU-GPU clusters for big data

C Chen, K Li, A Ouyang, K Li - IEEE Transactions on Computers, 2018 - ieeexplore.ieee.org
Research on in-memory big data management and processing has been prompted by the
increase in main memory capacity and the explosion in big data. By offering an efficient in …

Toward high-performance computing and big data analytics convergence: The case of spark-diy

S Caino-Lores, J Carretero, B Nicolae, O Yildiz… - IEEE …, 2019 - ieeexplore.ieee.org
Convergence between high-performance computing (HPC) and big data analytics (BDA) is
currently an established research area that has spawned new opportunities for unifying the …

Spark-diy: A framework for interoperable spark operations with high performance block-based data models

S Caíno-Lores, J Carretero, B Nicolae… - 2018 IEEE/ACM 5th …, 2018 - ieeexplore.ieee.org
Today's scientific applications are increasingly relying on a variety of data sources, storage
facilities, and computing infrastructures, and there is a growing demand for data analysis …

Formal modelling of a robust wireless sensor network routing protocol

K Saghar, W Henderson, D Kendall… - 2010 NASA/ESA …, 2010 - ieeexplore.ieee.org
Because of their low cost, small size, low resources and self-organizing nature a Wireless
Sensor Network (WSN) is a potential solution in hostile environments including military …

Counting kmers for biological sequences at large scale

J Ge, J Meng, N Guo, Y Wei, P Balaji… - Interdisciplinary Sciences …, 2020 - Springer
Counting the abundance of all the distinct kmers in biological sequence data is a
fundamental step in bioinformatics. These applications include de novo genome assembly …

Topo: Towards a Fine-grained Topological Data Processing Framework on Tianhe-3 Supercomputer

N Hu, Y Lu, Z Tang, Z Liu, D Huang, Z Chen - Journal of Parallel and …, 2024 - Elsevier
Big data frameworks are widely deployed in supercomputers for analyzing large-scale
datasets. Topological data processing is an emerging approach that focuses on analyzing …

Bloomfish: a highly scalable distributed k-mer counting framework

T Gao, Y Guo, Y Wei, B Wang, Y Lu… - 2017 IEEE 23rd …, 2017 - ieeexplore.ieee.org
K-mer counting is a fundamental operation in DNA research and genome analytics; its
application includes estimating genome assembly, understanding similarities in genomic …

An alternative C++-based HPC system for Hadoop MapReduce

V Srinivasakumar, M Vanamoorthy, S Sairaj… - Open Computer …, 2022 - degruyter.com
MapReduce (MR) is a technique used to improve distributed data processing vastly and can
massively speed up computation. Hadoop and MR rely on memory-intensive JVM and Java …