Data mining: opportunities and challenges

J Wang - 2003 - books.google.com
" An overview of the multidisciplinary field of data mining, this book focuses specifically on
new methodologies and case studies. Included are case studies written by 44 leading …

Web mining: Information and pattern discovery on the world wide web

R Cooley, B Mobasher… - Proceedings ninth IEEE …, 1997 - ieeexplore.ieee.org
Application of data mining techniques to the World Wide Web, referred to as Web mining,
has been the focus of several recent research projects and papers. However, there is no …

Integration of heterogeneous databases without common domains using queries based on textual similarity

WW Cohen - Proceedings of the 1998 ACM SIGMOD international …, 1998 - dl.acm.org
Most databases contain “name constants” like course numbers, personal names, and place
names that correspond to entities in the real world. Previous work in integration of …

[PDF][PDF] A hierarchical approach to wrapper induction

I Muslea, S Minton, C Knoblock - … of the third annual conference on …, 1999 - dl.acm.org
With the tremendous amount of information that becomes available on the Web on a daily
basis, the ability to quickly develop information agents has become a crucial problem. A vital …

WebOQL: Restructuring documents, databases, and webs

GO Arocena, AO Mendelzon - Theory and Practice of Object …, 1999 - Wiley Online Library
The widespread use of the Web has originated several new data management problems,
such as extracting data from Web pages and making databases accessible from Web …

Hierarchical wrapper induction for semistructured information sources

I Muslea, S Minton, CA Knoblock - Autonomous Agents and Multi-Agent …, 2001 - Springer
With the tremendous amount of information that becomes available on the Web on a daily
basis, the ability to quickly develop information agents has become a crucial problem. A vital …

Data integration using similarity joins and a word-based information representation language

WW Cohen - ACM Transactions on Information Systems (TOIS), 2000 - dl.acm.org
The integration of distributed, heterogeneous databases, such as those available on the
World Wide Web, poses many problems. Herer we consider the problem of integrating data …

[PDF][PDF] The Web as a resource for question answering: Perspectives and challenges

J Lin - In Proceedings of the Third International Conference …, 2002 - Citeseer
The vast amounts of information readily available on the World Wide Web can be effectively
used for question answering in two fundamentally different ways. In the federated approach …

Semistructured data: The TSIMMIS experience

J Hammer, J McHugh… - proceedings of the first …, 1997 - scienceopen.com
In this paper we discuss themanagement of semi-structured data, ie, data that has irregular
or dynamically changing structure. We describe components of the Stanford TSIMMIS …

Omnibase: Uniform access to heterogeneous data for question answering

B Katz, S Felshin, D Yuret, A Ibrahim, J Lin… - … on Application of …, 2002 - Springer
Abstract Although the World Wide Web contains a tremendous amount of information, the
lack of uniform structure makes finding the right knowledge difficult. A solution is to turn the …