Integration of heterogeneous databases without common domains using queries based on textual similarity

WW Cohen - Proceedings of the 1998 ACM SIGMOD international …, 1998 - dl.acm.org
Most databases contain “name constants” like course numbers, personal names, and place
names that correspond to entities in the real world. Previous work in integration of …

Learning object identification rules for information integration

S Tejada, CA Knoblock, S Minton - Information Systems, 2001 - Elsevier
When integrating information from multiple websites, the same data objects can exist in
inconsistent text formats across sites, making it difficult to identify matching objects using …

Approximate data instance matching: a survey

CF Dorneles, R Gonçalves… - … and Information Systems, 2011 - Springer
Approximate data matching is a central problem in several data management processes,
such as data integration, data cleaning, approximate queries, similarity search and so on. An …

Data integration using similarity joins and a word-based information representation language

WW Cohen - ACM Transactions on Information Systems (TOIS), 2000 - dl.acm.org
The integration of distributed, heterogeneous databases, such as those available on the
World Wide Web, poses many problems. Herer we consider the problem of integrating data …

[PDF][PDF] A web-based information system that reasons with structured collections of text

WW Cohen - Proceedings of the second international conference on …, 1998 - dl.acm.org
The degree to which information sources are prc-processed by Web-based information
systems varies greatly. ln search engines like Altavista, little pre-processing is done, while in …

WHIRL: A word-based information representation language

WW Cohen - Artificial Intelligence, 2000 - Elsevier
We describe WHIRL, an “information representation language” that synergistically combines
properties of logic-based and text-based representation systems. WHIRL is a subset of …

Advanced grouping and aggregation for data integration

E Schallehn, KU Sattler, G Saake - Proceedings of the tenth international …, 2001 - dl.acm.org
New applications from the areas of analytical data processing and data integration require
powerful features to condense and reconcile available data. As outlined in [1], the general …

[PDF][PDF] Extensible and similarity-based grouping for data integration

E Schallehn, KU Sattler, G Saake - PROCEEDINGS OF THE …, 2002 - Citeseer
Data integration as required in a variety of applications like data warehousing, information
system integration etc. makes great demands regarding features to deal with overlapping …

[PDF][PDF] Beyond full-text search: AI-based technology to support the knowledge cycle

DM Steier, SB Huffman, DI Kalish - AAAI Spring Symposium on AI in …, 1997 - cdn.aaai.org
From the mounds of raw information available electronically today, what professionais really
need are targeted, timely nuggets of knowledge that can guide the solution to business …

Reasoning about textual similarity in a Web-based information access system

WW Cohen - Autonomous Agents and Multi-Agent Systems, 1999 - Springer
The degree to which information sources are pre-processed by Web-based information
systems varies greatly. In search engines like Altavista, little pre-processing is done, while in …