Towards scalability and data skew handling in groupby-joins using mapreduce model

MAH Hassan, M Bamha - Procedia Computer Science, 2015 - Elsevier
For over a decade, MapReduce has become the leading programming model for parallel
and massive processing of large volumes of data. This has been driven by the development …

[PDF][PDF] Scalability and optimisation of groupby-joins in mapreduce

M Bamha, MAH Hassan - Technical report LIFO, Universit´ ed' …, 2015 - researchgate.net
For over a decade, MapReduce has become the leading programming model for parallel
and massive processing of large volumes of data. This has been driven by the development …

Scalability and Optimisation of GroupBy-Joins in MapReduce Scalability and Optimisation of GroupBy-Joins in MapReduce

M Bamha, MAH Hassan - 2015 - hal.science
For over a decade, MapReduce has become the leading programming model for parallel
and massive processing of large volumes of data. This has been driven by the development …

[PDF][PDF] ÉCOLE DOCTORALE SCIENCES ET TECHNOLOGIES

MALH HASSAN - 2009 - researchgate.net
La performance des processeurs et la capacité de stockage ne cessent de s' améliorer.
Cependant, le ralentissement et l'embouteillage au niveau des entrées/sorties sur les …

Parallélisme et équilibrage de charges dans le traitement de la jointure sur des architectures distribuées.

MAH Hassan - 2009 - theses.hal.science
L'émergence des applications de bases de données dans les domaines tels que le data
warehousing, le data mining et l'aide à la décision qui font généralement appel à de très …

[PS][PS] Performance Evaluation of a Parallel Algorithm for” GroupBy-Join” Queries Processing in Distributed Architectures

MAH Hassan, M Bamha - univ-orleans.fr
SQL queries involving join and group-by operations are fairly common in many decision
support applications where the size of the input relations is usually very large, so the …

[PDF][PDF] LIFO, Université d'Orléans

MAH Hassan, M Bamha - Citeseer
SQL queries involving join and group-by operations are fairly common in many decision
support applications where the size of the input relations is usually very large, so the …