[图书][B] Web data mining: exploring hyperlinks, contents, and usage data

B Liu - 2011 - Springer
Liu has written a comprehensive text on Web mining, which consists of two parts. The first
part covers the data mining and machine learning foundations, where all the essential …

Information extraction

S Sarawagi - Foundations and Trends® in Databases, 2008 - nowpublishers.com
The automatic extraction of information from unstructured sources has opened up new
avenues for querying, organizing, and analyzing data by drawing upon the clean semantics …

Data-Centric Systems and Applications

MJ Carey, S Ceri, P Bernstein, U Dayal, C Faloutsos… - Italy: Springer, 2006 - Springer
The rapid growth of the Web in the past two decades has made it the largest publicly
accessible data source in the world. Web mining aims to discover useful information or …

Web data extraction based on partial tree alignment

Y Zhai, B Liu - Proceedings of the 14th international conference on …, 2005 - dl.acm.org
This paper studies the problem of extracting data from a Web page that contains several
structured data records. The objective is to segment these data records, extract data …

Automated detection of refactorings in evolving components

D Dig, C Comertoglu, D Marinov, R Johnson - ECOOP 2006–Object …, 2006 - Springer
One of the costs of reusing software components is updating applications to use the new
version of the components. Updating an application can be error-prone, tedious, and …

Bridging text visualization and mining: A task-driven survey

S Liu, X Wang, C Collins, W Dou… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
Visual text analytics has recently emerged as one of the most prominent topics in both
academic research and the commercial world. To provide an overview of the relevant …

Privacy-preserving detection of sensitive data exposure

X Shu, D Yao, E Bertino - IEEE transactions on information …, 2015 - ieeexplore.ieee.org
Statistics from security firms, research institutions and government organizations show that
the number of data-leak instances have grown rapidly in recent years. Among various data …

The web changes everything: understanding the dynamics of web content

E Adar, J Teevan, ST Dumais, JL Elsas - Proceedings of the Second …, 2009 - dl.acm.org
The Web is a dynamic, ever changing collection of information. This paper explores changes
in Web content by analyzing a crawl of 55,000 Web pages, selected to represent different …

Special issue on web content mining

B Liu, K Chen-Chuan-Chang - Acm Sigkdd explorations newsletter, 2004 - dl.acm.org
With the phenomenal growth of the Web, there is an everincreasing volume of data and
information published in numerous Web pages. The research in Web mining aims to …

Automatic identification of informative sections of web pages

S Debnath, P Mitra, N Pal… - IEEE transactions on …, 2005 - ieeexplore.ieee.org
Web pages-especially dynamically generated ones-contain several items that cannot be
classified as the" primary content," eg, navigation sidebars, advertisements, copyright …