作者
Jimeng Sun, Spiros Papadimitriou, Ching-Yung Lin, Nan Cao, Shixia Liu, Weihong Qian
发表日期
2009/4/30
图书
Proceedings of the 2009 SIAM International Conference on Data Mining
页码范围
1064-1075
出版商
Society for Industrial and Applied Mathematics
简介
With the explosion of social media, scalability becomes a key challenge. There are two main aspects of the problems that arise: 1) data volume: how to manage and analyze huge datasets to efficiently extract patterns, 2) data understanding: how to facilitate understanding of the patterns by users?
To address both aspects of the scalability challenge, we present a hybrid approach that leverages two complementary disciplines, data mining and information visualization. In particular, we propose 1) an analytic data model for content-based networks using tensors; 2) an efficient high-order clustering framework for analyzing the data; 3) a scalable context-sensitive graph visualization to present the clusters.
We evaluate the proposed methods using both synthetic and real datasets. In terms of computational efficiency, the proposed methods are an order of magnitude faster compared to the baseline. In terms of …
引用总数
20092010201120122013201420152016201720182019202020212022202320241885379117109712452
学术搜索中的文章
J Sun, S Papadimitriou, CY Lin, N Cao, S Liu, W Qian - Proceedings of the 2009 SIAM International …, 2009