XML data clustering: An overview

A Algergawy, M Mesiti, R Nayak, G Saake - ACM Computing Surveys …, 2011 - dl.acm.org
In the last few years we have observed a proliferation of approaches for clustering XML
documents and schemas based on their structure and content. The presence of such a huge …

XML schema clustering with semantic and hierarchical similarity measures

R Nayak, W Iryadi - Knowledge-Based Systems, 2007 - Elsevier
With the growing popularity of XML as the data representation language, collections of the
XML data are exploded in numbers. The methods are required to manage and discover the …

Fast and effective clustering of XML data using structural information

R Nayak - Knowledge and Information Systems, 2008 - Springer
This paper presents the incremental clustering algorithm, XML documents Clustering with
Level Similarity (XCLS), that groups the XML documents according to structural similarity. A …

Xml document clustering using common xpath

H Leung, F Chung, SCF Chan… - International Workshop on …, 2005 - ieeexplore.ieee.org
XML is becoming a common way of storing data. The elements and their arrangement in the
document's hierarchy not only describe the document structure but also imply the data's …

A progressive clustering algorithm to group the XML data by structural and semantic similarity

R Nayak, T Tran - International Journal of Pattern Recognition and …, 2007 - World Scientific
Since the emergence in the popularity of XML for data representation and exchange over
the Web, the distribution of XML documents has rapidly increased. It has become a …

XCDSearch: An XML context-driven search engine

K Taha, R Elmasri - IEEE Transactions on Knowledge and Data …, 2010 - ieeexplore.ieee.org
We present in this paper, a context-driven search engine called XCDSearch for answering
XML Keyword-based queries as well as Loosely Structured queries, using a stack-based …

Xcls: A fast and effective clustering algorithm for heterogenous xml documents

R Nayak, S Xu - Pacific-Asia Conference on Knowledge Discovery and …, 2006 - Springer
We present a novel clustering algorithm to group the XML documents by similar structures.
We introduce a Level structure format to represent the XML documents for efficient …

BusSEngine: a business search engine

K Taha, R Elmasri - Knowledge and information systems, 2010 - Springer
With the emergence of World Wide Web, business' databases are increasingly being
queried directly by customers. The customers may not be aware of the underlying data and …

Learning element similarity matrix for semi-structured document analysis

J Yang, WK Cheung, X Chen - Knowledge and Information Systems, 2009 - Springer
Capturing latent structural and semantic properties in semi-structured documents (eg, XML
documents) is crucial for improving the performance of related document analysis tasks …

[引用][C] GML 文档结构聚类算法Clu-GML

苗建新, 吉根林 - 南京大学学报: 自然科学版, 2008