Petuum: A new platform for distributed machine learning on big data

EP Xing, Q Ho, W Dai, JK Kim, J Wei, S Lee… - Proceedings of the 21th …, 2015 - dl.acm.org
How can one build a distributed framework that allows efficient deployment of a wide
spectrum of modern advanced machine learning (ML) programs for industrial-scale …

[HTML][HTML] Strategies and principles of distributed machine learning on big data

EP Xing, Q Ho, P Xie, D Wei - Engineering, 2016 - Elsevier
The rise of big data has led to new demands for machine learning (ML) systems to learn
complex models, with millions to billions of parameters, that promise adequate capacity to …

[PDF][PDF] MLbase: A Distributed Machine-learning System.

T Kraska, A Talwalkar, JC Duchi, R Griffith, MJ Franklin… - Cidr, 2013 - i.stanford.edu
Machine learning (ML) and statistical techniques are key to transforming big data into
actionable knowledge. In spite of the modern primacy of data, the complexity of existing ML …

On model parallelization and scheduling strategies for distributed machine learning

S Lee, JK Kim, X Zheng, Q Ho… - Advances in neural …, 2014 - proceedings.neurips.cc
Distributed machine learning has typically been approached from a data parallel
perspective, where big data are partitioned to multiple workers and an algorithm is executed …

Machine learning and cloud computing: Survey of distributed and saas solutions

D Pop - arXiv preprint arXiv:1603.08767, 2016 - arxiv.org
Applying popular machine learning algorithms to large amounts of data raised new
challenges for the ML practitioners. Traditional ML libraries does not support well processing …

Bigdl: A distributed deep learning framework for big data

JJ Dai, Y Wang, X Qiu, D Ding, Y Zhang… - Proceedings of the …, 2019 - dl.acm.org
ThispaperpresentsBigDL (adistributeddeeplearning framework for Apache Spark), which
has been used by a variety of users in the industry for building deep learning applications on …

High-performance distributed ML at scale through parameter server consistency models

W Dai, A Kumar, J Wei, Q Ho, G Gibson… - Proceedings of the AAAI …, 2015 - ojs.aaai.org
Abstract As Machine Learning (ML) applications embrace greater data size and model
complexity, practitioners turn to distributed clusters to satisfy the increased computational …

Systemml: Declarative machine learning on spark

M Boehm, MW Dusenberry, D Eriksson… - Proceedings of the …, 2016 - dl.acm.org
The rising need for custom machine learning (ML) algorithms and the growing data sizes
that require the exploitation of distributed, data-parallel frameworks such as MapReduce or …

Map-reduce for machine learning on multicore

CT Chu, S Kim, YA Lin, YY Yu… - Advances in neural …, 2006 - proceedings.neurips.cc
We are at the beginning of the multicore era. Computers will have increasingly many cores
(processors), but there is still no good programming framework for these architectures, and …

Parallel programming paradigms and frameworks in big data era

C Dobre, F Xhafa - International Journal of Parallel Programming, 2014 - Springer
Abstract With Cloud Computing emerging as a promising new approach for ad-hoc parallel
data processing, major companies have started to integrate frameworks for parallel data …