Methodologies for data quality assessment and improvement

C Batini, C Cappiello, C Francalanci… - ACM computing surveys …, 2009 - dl.acm.org
The literature provides a wide range of techniques to assess and improve the quality of data.
Due to the diversity and complexity of these techniques, research has recently focused on …

A brief survey of web data extraction tools

AHF Laender, BA Ribeiro-Neto, AS Da Silva… - ACM Sigmod …, 2002 - dl.acm.org
In the last few years, several works in the literature have addressed the problem of data
extraction from Web pages. The importance of this problem derives from the fact that, once …

[PDF][PDF] Deep Web 数据集成研究综述

刘伟, 孟小峰, 孟卫一 - 计算机学报, 2007 - c.xml.org.cn
As the rapid development of World Wide Web, there is tremendous information" hiddened" in
Deep Web, and its capacity is increasing rapidly. The information can only be accessed by …

[PDF][PDF] Crawling the hidden web

S Raghavan, H Garcia-Molina - Vldb, 2001 - vldb.org
Current-day crawlers retrieve content only from the publicly indexable Web, ie, the set of
Web pages reachable purely by following hypertext links, ignoring search forms and pages …

Tools and approaches for developing data-intensive web applications: a survey

P Fraternali - ACM Computing Surveys (CSUR), 1999 - dl.acm.org
The exponential growth and capillar diffusion of the Web are nurturing a novel generation of
applications, characterized by a direct business-to-customer relationship. The development …

A survey on region extractors from web documents

HA Sleiman, R Corchuelo - IEEE Transactions on Knowledge …, 2012 - ieeexplore.ieee.org
Extracting information from web documents has become a research area in which new
proposals sprout out year after year. This has motivated several researchers to work on …

[图书][B] Generic model management: concepts and algorithms

S Melnik - 2004 - books.google.com
Many challenging problems in information systems engineering involve the manipulation of
complex metadata artifacts or models, such as database schema, interface specifications, or …

Model-driven development of Web applications: the AutoWeb system

P Fraternali, P Paolini - ACM Transactions on Information Systems (TOIS …, 2000 - dl.acm.org
This paper describes a methodology for the development of WWW applications and a tool
environment specifically tailored for the methodology. The methodology and the …

Varv: Reprogrammable interactive software as a declarative data structure

M Borowski, L Murray, R Bagge, JB Kristensen… - Proceedings of the …, 2022 - dl.acm.org
Most modern applications are immutable and turn-key despite the acknowledged benefits of
empowering users to modify their software. Writing extensible software remains challenging …

[PDF][PDF] Engineering semantic web information systems in hera

R Vdovjak, F Frasincar, GJ Houben, P Barna - J. Web Eng., 2003 - academia.edu
The success of the World Wide Web has caused the concept of information system to
change. Web Information Systems (WIS) use from the Web its paradigm and technologies in …