Information retrieval on the web

M Kobayashi, K Takeda - ACM computing surveys (CSUR), 2000 - dl.acm.org
In this paper we review studies of the growth of the Internet and technologies that are useful
for information search and retrieval on the Web. We present data on the Internet from several …

Heterogeneity for the win: One-shot federated clustering

DK Dennis, T Li, V Smith - International Conference on …, 2021 - proceedings.mlr.press
In this work, we explore the unique challenges—and opportunities—of unsupervised
federated learning (FL). We develop and analyze a one-shot federated clustering scheme …

Iterative big data clustering algorithms: a review

A Mohebi, S Aghabozorgi, T Ying Wah… - Software: Practice …, 2016 - Wiley Online Library
Enterprises today are dealing with the massive size of data, which have been explosively
increasing. The key requirements to address this challenge are to extract, analyze, and …

Distributed -Means Algorithm and Fuzzy -Means Algorithm for Sensor Networks Based on Multiagent Consensus Theory

J Qin, W Fu, H Gao, WX Zheng - IEEE transactions on …, 2016 - ieeexplore.ieee.org
This paper is concerned with developing a distributed k-means algorithm and a distributed
fuzzy c-means algorithm for wireless sensor networks (WSNs) where each node is equipped …

Scalable k-means++

B Bahmani, B Moseley, A Vattani, R Kumar… - arXiv preprint arXiv …, 2012 - arxiv.org
Over half a century old and showing no signs of aging, k-means remains one of the most
popular data processing algorithms. As is well-known, a proper initialization of k-means is …

Data clustering: 50 years beyond K-means

AK Jain - Pattern recognition letters, 2010 - Elsevier
Organizing data into sensible groupings is one of the most fundamental modes of
understanding and learning. As an example, a common scheme of scientific classification …

A survey of clustering data mining techniques

P Berkhin - Grouping multidimensional data: Recent advances in …, 2006 - Springer
Clustering is the division of data into groups of similar objects. In clustering, some details are
disregarded in exchange for data simplification. Clustering can be viewed as a data …

A simulation-driven methodology for IoT data mining based on edge computing

C Savaglio, G Fortino - ACM Transactions on Internet Technology (TOIT), 2021 - dl.acm.org
With the ever-increasing diffusion of smart devices and Internet of Things (IoT) applications,
a completely new set of challenges have been added to the Data Mining domain. Edge …

Concept decompositions for large sparse text data using clustering

IS Dhillon, DS Modha - Machine learning, 2001 - Springer
Unlabeled document collections are becoming increasingly common and available; mining
such data sets represents a major contemporary challenge. Using words as features, text …

Parallel spectral clustering in distributed systems

WY Chen, Y Song, H Bai, CJ Lin… - IEEE transactions on …, 2010 - ieeexplore.ieee.org
Spectral clustering algorithms have been shown to be more effective in finding clusters than
some traditional algorithms, such as k-means. However, spectral clustering suffers from a …