Coding for large-scale distributed machine learning

M Xiao, M Skoglund - Entropy, 2022 - mdpi.com
This article aims to give a comprehensive and rigorous review of the principles and recent
development of coding for large-scale distributed machine learning (DML). With increasing …

Lightweight projective derivative codes for compressed asynchronous gradient descent

PJ Soto, I Ilmer, H Guan, J Li - International Conference on …, 2022 - proceedings.mlr.press
Coded distributed computation has become common practice for performing gradient
descent on large datasets to mitigate stragglers and other faults. This paper proposes a …

Group-wise Verifiable Coded Computing under Byzantine Attacks and Stragglers

S Hong, H Yang, Y Yoon, J Lee - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Distributed computing has emerged as a promising solution for accelerating machine
learning training processes on large-scale datasets by leveraging the parallel processing …

Partial Decode and Compare: An Efficient Verification Scheme for Coded Edge Computing

J Wang, W Jiang, J Zhou, Z Lu, K Lu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In recent years, Coded Edge Computing (CEC) has been greatly studied as a promising
technology to effectively mitigate the impact of stragglers and provide confidentiality in edge …

A Family of Binary Locally Repairable Codes for Coded Distributed Computing

MF Qharabagh, M Ardakani - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
One of the main bottlenecks in distributed computing systems is the stragglers' problem.
Error correction codes have been proposed to alleviate this problem at the cost of coding …

Flexible distributed matrix multiplication

W Li, Z Chen, Z Wang, SA Jafar… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
The distributed matrix multiplication problem with an unknown number of stragglers is
considered, where the goal is to efficiently and flexibly obtain the product of two massive …

Sequence-Aware Coding for Leveraging Stragglers in Coded Matrix Multiplication

X Fan, P Soto, Y Zou, X Su, J Li - ICC 2023-IEEE International …, 2023 - ieeexplore.ieee.org
Matrix multiplication is a foundational building block in various data-intensive workloads.
With the fast-increasing sizes of workloads, it is common to split the job of matrix …

Energy Efficient Partial Distributed Coded Computing in Edge Computing

Y Li, D Zeng, H Geng, Z Yang - GLOBECOM 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Edge computing is considered a promising computing paradigm that can mitigate energy
consumption and workload of end devices through task offloading to edge servers. Albeit …

Design and Performance Analysis of Partial Computation Output Schemes for Accelerating Coded Machine Learning

X Xu, X Lin, L Duan - IEEE Transactions on Network Science …, 2022 - ieeexplore.ieee.org
Coded machine learning is a technique to use codes, such as-maximum-distance-separable
(-MDS) codes, to reduce the negative effect of stragglers by requiring out of workers to …

Sequence-aware Coding for Matrix Multiplication with Arbitrary Recoverability

Y Zou, J Li - ICC 2024-IEEE International Conference on …, 2024 - ieeexplore.ieee.org
Matrix multiplication is a crucial operation in many data-intensive workloads. Given the large
size of matrices in today's workloads, it is common to split the computation into tasks …