[HTML][HTML] One-pass MapReduce-based clustering method for mixed large scale data

MA Ben HajKacem, CEB N'cir, N Essoussi - Journal of Intelligent …, 2019 - Springer
Big data is often characterized by a huge volume and a mixed types of attributes namely,
numeric and categorical. K-prototypes has been fitted into MapReduce framework and …

Ensemble method for multi-view text clustering

M Fraj, MA Ben Hajkacem, N Essoussi - … 4–6, 2019, Proceedings, Part I 11, 2019 - Springer
Textual data frequently occurs as an unlabeled document collection, therefore it is useful to
sort this collection into clusters of related documents. On the other hand, text has different …

STiMR -Means: An Efficient Clustering Method for Big Data

MA Ben HajKacem, CE Ben N′ Cir… - International Journal of …, 2019 - World Scientific
Big Data clustering has become an important challenge in data analysis since several
applications require scalable clustering methods to organize such data into groups of similar …

A parallel text clustering method using Spark and hashing

MA Ben HajKacem, CE Ben N'cir, N Essoussi - Computing, 2021 - Springer
Clustering textual data has become an important task in data analytics since several
applications require to automatically organizing large amounts of textual documents into …

On the use of ensemble method for multi view textual data

M Fraj, MA Ben Hajkacem… - Journal of Information and …, 2020 - Taylor & Francis
Nowadays, trends detection is an important task on social media to determine trends that are
being discussed the most on a social platform. One of the main challenges of this task is the …

PSubCLUS: A Parallel Subspace Clustering Algorithm Based On Spark

X Wen, H Juan - IEEE Access, 2020 - ieeexplore.ieee.org
Clustering is one of the most important unsupervised machine learning tasks. It is widely
used to solve problems of intrusion detection, text analysis, image segmentation etc …

A novel linear time clustering using heuristically improved mrk-medoids based on modified squirrel search algorithm

D Puri, D Gupta - Australian Journal of Electrical and Electronics …, 2024 - Taylor & Francis
The rapid development of different techniques and the data are accumulated with distinctive
properties with high dimensions and huge size. The most essential approach in data mining …

KP-S: a spark-based design of the K-prototypes clustering for big data

MAB HajKacem, CEB N'Cir… - 2017 IEEE/ACS 14th …, 2017 - ieeexplore.ieee.org
Big data is often characterized by a huge volume and a mixed types of attributes namely,
numeric and categorical. K-prototypes is one of the most well-known clustering methods to …

Parallel k-prototypes clustering with high efficiency and accuracy

H Jridi, MA Ben HajKacem, N Essoussi - Big Data Analytics and …, 2020 - Springer
Big data is often characterized by a huge volume and mixed types of data including numeric
and categorical. The k-prototypes is one of the best-known clustering methods for mixed …

BubbleNet: An innovative exploratory search and summarization interface with applicability in health social media

S Mohajeri, HW Samuel, OR Zalane… - … Conference on Digital …, 2016 - ieeexplore.ieee.org
We analyse the application of various interfaces to facilitate exploratory search and
summarization of documents, especially BubbleNet, an innovative interface for summarizing …