N Kushmerick - Artificial intelligence, 2000 - Elsevier
The Internet presents numerous sources of useful information—telephone directories, product catalogs, stock quotes, event listings, etc. Recently, many systems have been built …
D Florescu, A Levy, A Mendelzon - ACM Sigmod Record, 1998 - dl.acm.org
The popularity of the World-Wide Web (WWW) has made it a prime vehicle for disseminating information. The relevance of database concepts to the problems of managing and querying …
A Sahuguet, F Azavant - Data & Knowledge Engineering, 2001 - Elsevier
The Web so far has been incredibly successful at delivering information to human users. So successful actually, that there is now an urgent need to go beyond a browsing human …
R Baumgartner, S I'lesca, G Gottlob… - US Patent 7,581,170, 2009 - Google Patents
A method and a system for information extraction from Web pages formatted with markup languages such as HTML [8]. A method and system for interactively and visually describing …
In this paper we present DEByE (Data Extraction By Example), an approach to extracting data from Web sources, based on a small set of examples specified by the user. The novelty …
The goal of information extraction (IE) is to transform text into a structured format and thereby reducing the information in a document to a tabular structure. Unseen texts are taken as …
The Web has become a major conduit to information repositories of all kinds. Today, more than 80% of information published on the Web is generated by underlying databases …
The Practical Handbook of Internet Computing analyzes a broad array of technologies and concerns related to the Internet, including corporate intranets. Fresh and insightful articles by …
R Cooley - ACM Transactions on Internet Technology (TOIT), 2003 - dl.acm.org
The discipline of Web Usage Mining has grown rapidly in the past few years, despite the crash of the e-commerce boom of the late 1990s. Web Usage Mining is the application of …