查看文章

psu.edu 中的 [PDF]

A comparative analysis of particle swarm optimization and K-means algorithm for text clustering using Nepali Wordnet

作者

Sunita Sarkar, Arindam Roy, BS Purkayastha

发表日期

2014/6

期刊

Int. J. Nat. Lang. Comput.(IJNLC)

卷号

期号

简介

The volume of digitized text documents on the web have been increasing rapidly. As there is huge collection of data on the web there is a need for grouping (clustering) the documents into clusters for speedy information retrieval. Clustering of documents is collection of documents into groups such that the documents within each group are similar to each other and not to documents of other groups. Quality of clustering result depends greatly on the representation of text and the clustering algorithm. This paper presents a comparative analysis of three algorithms namely K-means, Particle swarm Optimization (PSO) and hybrid PSO+ K-means algorithm for clustering of text documents using WordNet. The common way of representing a text document is bag of terms. The bag of terms representation is often unsatisfactory as it does not exploit the semantics. In this paper, texts are represented in terms of synsets corresponding to a word. Bag of terms data representation of text is thus enriched with synonyms from WordNet. K-means, Particle Swarm Optimization (PSO) and hybrid PSO+ K-means algorithms are applied for clustering of text in Nepali language. Experimental evaluation is performed by using intra cluster similarity and inter cluster similarity.

引用总数

被引用次数：24

2015201620172018201920202021202220232 3 2 1 2 3 2 4 4

学术搜索中的文章

A comparative analysis of particle swarm optimization and K-means algorithm for text clustering using Nepali Wordnet

S Sarkar, A Roy, BS Purkayastha - Int. J. Nat. Lang. Comput.(IJNLC), 2014

被引用次数：24 相关文章所有 4 个版本