State-of-the-art in Parallel Computing with R

M Schmidberger, M Morgan… - Journal of …, 2009 - epub.ub.uni-muenchen.de
R is a mature open-source programming language for statistical computing and graphics.
Many areas of statistical research are experiencing rapid growth in the size of data sets …

Toolkit-based high-performance data mining of large data on MapReduce clusters

D Wegener, M Mock, D Adranale… - 2009 IEEE International …, 2009 - ieeexplore.ieee.org
The enormous growth of data in a variety of applications has increased the need for high
performance data mining based on distributed environments. However, standard data …

Parallelizing the execution of native data mining algorithms for computational biology

G Coro, L Candela, P Pagano… - Concurrency and …, 2015 - Wiley Online Library
Data mining is being increasingly used in biology. Biologists are adopting prototyping
languages, like R and Matlab, to facilitate the application of data mining algorithms to their …

A level-encoded transition signaling protocol for high-throughput asynchronous global communication

PB McGee, MY Agyekum… - 2008 14th IEEE …, 2008 - ieeexplore.ieee.org
A new delay-insensitive data encoding scheme for global communication, level-encoded
transition signaling (LETS), is introduced. LETS is a generalization of level-encoded dual rail …

The technologically integrated oncosimulator: combining multiscale cancer modeling with information technology in the in silico oncology context

G Stamatakos, D Dionysiou, A Lunzer… - IEEE Journal of …, 2013 - ieeexplore.ieee.org
This paper outlines the major components and function of the technologically integrated
oncosimulator developed primarily within the Advancing Clinico Genomic Trials on Cancer …

[PDF][PDF] A hedgehop over a max-margin framework using hedge cues

M Georgescul - Proceedings of the Fourteenth Conference on …, 2010 - aclanthology.org
In this paper, we describe the experimental settings we adopted in the context of the 2010
CoNLL shared task for detecting sentences containing uncertainty. The classification results …

GridR: An R-based tool for scientific data analysis in grid environments

D Wegener, T Sengstag, S Sfakianakis… - Future Generation …, 2009 - Elsevier
In this paper, we describe an analysis tool based on the statistical environment R, GridR,
which allows using the collection of methodologies available as R packages in a grid …

Designing social cognition models for multi-agent systems through simulating primate societies

S Picault, A Collinot - … Conference on Multi Agent Systems (Cat …, 1998 - ieeexplore.ieee.org
In this paper we discuss the advantages of investigating primate societies to build Multi-
Agent Systems, and we present our preliminary results in this context. We first give an …

Routine multiple imputation in statistical databases

S van Buuren, EM van Mulligen… - … Working Conference on …, 1994 - ieeexplore.ieee.org
This paper deals with problems concerning missing data in statistical databases. Multiple
imputation is a statistically sound technique for handling incomplete data. Two problems …

Web-based authoring and secure enactment of bioinformatics workflows

S Sfakianakis, L Koumakis… - 2009 Workshops at …, 2009 - ieeexplore.ieee.org
The recent advances in the field of bioinformatics present a number of challenges in the
secure and efficient management and analysis of biological data resources. Workflow …