Liu has written a comprehensive text on Web mining, which consists of two parts. The first part covers the data mining and machine learning foundations, where all the essential …
The rapid growth of the Web in the past two decades has made it the largest publicly accessible data source in the world. Web mining aims to discover useful information or …
If you're a developer working with XML, you know there's a lot to know about XML, and the XML space is evolving almost moment by moment. But you don't need to commit every XML …
DDC Reis, PB Golgher, AS Silva… - Proceedings of the 13th …, 2004 - dl.acm.org
The Web poses itself as the largest data repository ever available in the history of humankind. Major efforts have been made in order to provide efficient access to relevant …
Y Zhai, B Liu - IEEE Transactions on Knowledge and Data …, 2006 - ieeexplore.ieee.org
This paper studies the problem of structured data extraction from arbitrary Web pages. The objective of the proposed research is to automatically segment data records in a page …
Information extraction from websites is nowadays a relevant problem, usually performed by software modules called wrappers. A key requirement is that the wrapper generation …
Many Web sites, especially those that dynamically generate HTML pages to display the results of a user's query, present information in the form of list or tables. Current tools that …
Theories of human behavior are an important but largely untapped resource for software engineering research. They facilitate understanding of human developers' needs and …
Y Lu, H He, H Zhao, W Meng… - IEEE transactions on …, 2011 - ieeexplore.ieee.org
An increasing number of databases have become web accessible through HTML form- based search interfaces. The data units returned from the underlying database are usually …