查看文章

mdpi.com 中的 [HTML]

Data stream clustering techniques, applications, and models: comparative analysis and discussion

作者

Umesh Kokate, Arvind Deshpande, Parikshit Mahalle, Pramod Patil

发表日期

2018/10/17

来源

Big Data and Cognitive Computing

卷号

期号

页码范围

出版商

MDPI

简介

Data growth in today’s world is exponential, many applications generate huge amount of data streams at very high speed such as smart grids, sensor networks, video surveillance, financial systems, medical science data, web click streams, network data, etc. In the case of traditional data mining, the data set is generally static in nature and available many times for processing and analysis. However, data stream mining has to satisfy constraints related to real-time response, bounded and limited memory, single-pass, and concept-drift detection. The main problem is identifying the hidden pattern and knowledge for understanding the context for identifying trends from continuous data streams. In this paper, various data stream methods and algorithms are reviewed and evaluated on standard synthetic data streams and real-life data streams. Density-micro clustering and density-grid-based clustering algorithms are discussed and comparative analysis in terms of various internal and external clustering evaluation methods is performed. It was observed that a single algorithm cannot satisfy all the performance measures. The performance of these data stream clustering algorithms is domain-specific and requires many parameters for density and noise thresholds.

引用总数

被引用次数：68

2019202020212022202320245 10 18 12 10 10

学术搜索中的文章

Data stream clustering techniques, applications, and models: comparative analysis and discussion

U Kokate, A Deshpande, P Mahalle, P Patil - Big Data and Cognitive Computing, 2018

被引用次数：68 相关文章所有 4 个版本