Document clustering using synthetic cluster prototypes

A Kalogeratos, A Likas - Data & Knowledge Engineering, 2011 - Elsevier
… as prototypes for clustering text documents with the k-means … searched that partitions the
document into k disjoint clusters. … feature subspace in which each document class can be better …

Extending recommender systems for disjoint user/item sets: The conference recommendation problem

M Hornick, P Tamayo - IEEE Transactions on Knowledge and …, 2012 - ieeexplore.ieee.org
disjoint, but similar, set of items from the set on which actual preferences have been made.
This class of … For example, document-term matrices are used to represent text objects such as …

A segment-based approach to clustering multi-topic documents

A Tagarelli, G Karypis - Knowledge and information systems, 2013 - Springer
… of distinct classes, or topics, that exist in a set of documents \(\… coherent contiguous regions
of text in a document, we adopt … document, we consider algorithms that produce both disjoint

Mining distinction and commonality across multiple domains using generative model for text classification

F Zhuang, P Luo, Z Shen, Q He, Y Xiong… - … on Knowledge and …, 2011 - ieeexplore.ieee.org
document class for classification as a document concept here, document class and document
… [15] tried to fill up those missing values of disjoint features to drive the marginal distribu…

[PDF][PDF] Dimension reduction in text classification with support vector machines.

H Kim, P Howland, H Park, N Christianini - Journal of machine learning …, 2005 - jmlr.org
… To allow an assignment of any document to multiple classes, we introduce the decision rule
disjoint data set, but not for a data set which contains documents belonging multiple classes. …

An integration of WordNet and fuzzy association rule mining for multi-label document clustering

CL Chen, FSC Tseng, T Liang - Data & Knowledge Engineering, 2010 - Elsevier
… With the rapid growth of text documents, document clustering … cluster) and produce a set of
disjoint clusters. Soft clustering … : this document set is a combination of the four classes CACM, …

Collaborative dual-plsa: mining distinction and commonality across multiple domains for text classification

F Zhuang, P Luo, Z Shen, Q He, Y Xiong, Z Shi… - Proceedings of the 19th …, 2010 - dl.acm.org
document class for classification as a document concept here, document class and document
… Since we know the class label of each document in the source domains, we actually know …

An efficient k-means algorithm integrated with Jaccard distance measure for document clustering

R Ferdous - … first asian himalayas international conference on …, 2009 - ieeexplore.ieee.org
… set of documents into disjoint clusters where documents in … Document clustering or
Text categorization is closely related … we found the documents distributed into some class/category …

Learning to classify documents according to genre

A Finn, N Kushmerick - Journal of the American Society for …, 2006 - Wiley Online Library
… We believe that the genre or style of text in a document can … clear boundaries and need not
be disjoint. For a given subject … The first genre class we investigate is whether a document is …

The document components ontology (DoCO)

A Constantin, S Peroni, S Pettifer, D Shotton… - Semantic …, 2016 - content.iospress.com
… to obtain four different disjoint classes describing entities that (A) contain both text and … text
(po:Bucket), (C) contain text but do not contain substructures (po:Flat), (D) do not contain text, …