Overview of accurate coresets

I Jubran, A Maalouf, D Feldman - … Reviews: Data Mining and …, 2021 - Wiley Online Library
A coreset of an input set is its small summarization, such that solving a problem on the
coreset as its input, provably yields the same result as solving the same problem on the …

Fast and accurate least-mean-squares solvers

A Maalouf, I Jubran, D Feldman - Advances in Neural …, 2019 - proceedings.neurips.cc
Least-mean squares (LMS) solvers such as Linear/Ridge/Lasso-Regression, SVD and
Elastic-Net not only solve fundamental machine learning problems, but are also the building …

Tight sensitivity bounds for smaller coresets

A Maalouf, A Statman, D Feldman - Proceedings of the 26th ACM …, 2020 - dl.acm.org
An ε-coreset to the dimensionality reduction problem for a (possibly very large) matrix A∈
Rn xd is a small scaled subset of its n rows that approximates their sum of squared distances …

Fast and accurate least-mean-squares solvers for high dimensional data

A Maalouf, I Jubran, D Feldman - IEEE Transactions on Pattern …, 2022 - ieeexplore.ieee.org
Least-mean-squares (LMS) solvers such as Linear/Ridge-Regression and SVD not only
solve fundamental machine learning problems, but are also the building blocks in a variety …

Sketch and validate for big data clustering

PA Traganitis, K Slavakis… - IEEE Journal of Selected …, 2015 - ieeexplore.ieee.org
In response to the need for learning tools tuned to big data analytics, the present paper
introduces a framework for efficient clustering of huge sets of (possibly high-dimensional) …

Fast and accurate least-mean-squares solvers

I Jubran, A Maalouf, D Feldman - 2019 - openreview.net
Least-mean squares (LMS) solvers such as Linear/Ridge/Lasso-Regressions, SVD and
Elastic-Nets not only solve fundamental machine learning problems, but are also the …

Large-scale Clustering using Random Sketching and Validation

P Traganitis - 2015 - search.proquest.com
The advent of high-speed Internet, modern devices and global connectivity has introduced
the world to massive amounts of data, that are being generated, communicated and …