Structured databases on the web: Observations and implications

KCC Chang, B He, C Li, M Patel, Z Zhang - Acm Sigmod Record, 2004 - dl.acm.org
The Web has been rapidly" deepened" by the prevalence of databases online. With the
potentially unlimited information hidden behind their query interfaces, this" deep Web" of …

Wrapper induction: Efficiency and expressiveness

N Kushmerick - Artificial intelligence, 2000 - Elsevier
The Internet presents numerous sources of useful information—telephone directories,
product catalogs, stock quotes, event listings, etc. Recently, many systems have been built …

Database techniques for the World-Wide Web: A survey

D Florescu, A Levy, A Mendelzon - ACM Sigmod Record, 1998 - dl.acm.org
The popularity of the World-Wide Web (WWW) has made it a prime vehicle for disseminating
information. The relevance of database concepts to the problems of managing and querying …

Building intelligent web applications using lightweight wrappers

A Sahuguet, F Azavant - Data & Knowledge Engineering, 2001 - Elsevier
The Web so far has been incredibly successful at delivering information to human users. So
successful actually, that there is now an urgent need to go beyond a browsing human …

Visual and interactive wrapper generation, automated information extraction from Web pages, and translation into XML

R Baumgartner, S I'lesca, G Gottlob… - US Patent 7,581,170, 2009 - Google Patents
A method and a system for information extraction from Web pages formatted with markup
languages such as HTML [8]. A method and system for interactively and visually describing …

DEByE–data extraction by example

AHF Laender, B Ribeiro-Neto, AS Da Silva - Data & Knowledge …, 2002 - Elsevier
In this paper we present DEByE (Data Extraction By Example), an approach to extracting
data from Web sources, based on a small set of examples specified by the user. The novelty …

[PDF][PDF] Information extraction from world wide web-a survey

L Eikvil - 1999 - nr.no
The goal of information extraction (IE) is to transform text into a structured format and thereby
reducing the information in a document to a tabular structure. Unseen texts are taken as …

[PDF][PDF] Building light-weight wrappers for legacy Web data-sources using W4F

A Sahuguet, F Azavant - Vldb, 1999 - Citeseer
The Web has become a major conduit to information repositories of all kinds. Today, more
than 80% of information published on the Web is generated by underlying databases …

[图书][B] The practical handbook of internet computing

MP Singh - 2004 - taylorfrancis.com
The Practical Handbook of Internet Computing analyzes a broad array of technologies and
concerns related to the Internet, including corporate intranets. Fresh and insightful articles by …

The use of web structure and content to identify subjectively interesting web usage patterns

R Cooley - ACM Transactions on Internet Technology (TOIT), 2003 - dl.acm.org
The discipline of Web Usage Mining has grown rapidly in the past few years, despite the
crash of the e-commerce boom of the late 1990s. Web Usage Mining is the application of …