Ten quick tips for bioinformatics analyses using an Apache Spark distributed computing environment

D Chicco, U Ferraro Petrillo… - PLOS Computational …, 2023 - journals.plos.org
Some scientific studies involve huge amounts of bioinformatics data that cannot be analyzed
on personal computers usually employed by researchers for day-to-day activities but rather …

Framing Apache Spark in life sciences

A Manconi, M Gnocchi, L Milanesi, O Marullo… - Heliyon, 2023 - cell.com
Advances in high-throughput and digital technologies have required the adoption of big data
for handling complex tasks in life sciences. However, the drift to big data led researchers to …

Energy analysis of Internet of things data mining algorithm for smart green communication networks

Z Du - Computer Communications, 2020 - Elsevier
With the continuous development of Internet technology and electronic information
technology, big data technology and cloud computing technology also rise and develop, and …

[PDF][PDF] Novel Dynamic Scaling Algorithm for Energy Efficient Cloud Computing.

MV Kumar, K Venkatachalam, M Masud… - … Automation & Soft …, 2022 - cdn.techscience.cn
Huge data processing applications are stored efficiently using cloud computing platform.
Few technologies like edge computing, Internet of Things (IoT) model helps cloud computing …

Analyzing big datasets of genomic sequences: fast and scalable collection of k-mer statistics

U Ferraro Petrillo, M Sorella, G Cattaneo… - BMC …, 2019 - Springer
Background Distributed approaches based on the MapReduce programming paradigm
have started to be proposed in the Bioinformatics domain, due to the large amount of data …

Prot-SpaM: fast alignment-free phylogeny reconstruction based on whole-proteome sequences

CA Leimeister, J Schellhorn, S Dörrer, M Gerth… - …, 2019 - academic.oup.com
Word-based or 'alignment-free'sequence comparison has become an active research area
in bioinformatics. While previous word-frequency approaches calculated rough measures of …

Read-SpaM: assembly-free and alignment-free comparison of bacterial genomes with low sequencing coverage

AK Lau, S Dörrer, CA Leimeister, C Bleidorn… - BMC …, 2019 - Springer
Background In many fields of biomedical research, it is important to estimate phylogenetic
distances between taxa based on low-coverage sequencing reads. Major applications are …

GPrimer: a fast GPU-based pipeline for primer design for qPCR experiments

J Bae, H Jeon, MS Kim - BMC bioinformatics, 2021 - Springer
Background Design of valid high-quality primers is essential for qPCR experiments.
MRPrimer is a powerful pipeline based on MapReduce that combines both primer design for …

Using software visualization to support the teaching of distributed programming

L Di Rocco, U Ferraro Petrillo, F Palini - The Journal of Supercomputing, 2023 - Springer
In this paper, we introduce MARVEL, a system designed to simplify the teaching of
MapReduce, a popular distributed programming paradigm, through software visualization …

Energy analysis and application of data mining algorithms for internet of things based on hadoop cloud platform

Y Zheng, G Chen - IEEE Access, 2019 - ieeexplore.ieee.org
The paper analyses and studies the classification and characteristics of Internet of Things
(IoT) information, and discusses the construction and application of Hadoop Cloud Platform …