Building efficient and effective metasearch engines

W Meng, C Yu, KL Liu - ACM Computing Surveys (CSUR), 2002 - dl.acm.org
Frequently a user's information needs are stored in the databases of multiple search
engines. It is inconvenient and inefficient for an ordinary user to invoke multiple search …

Using word clusters to detect similar web documents

J Koberstein, YK Ng - … conference on knowledge science, engineering and …, 2006 - Springer
It is relatively easy to detect exact matches in Web documents; however, detecting similar
content in distinct Web documents with different words and sentence structures is a much …

Towards a highly-scalable and effective metasearch engine

Z Wu, W Meng, C Yu, Z Li - … of the 10th international conference on …, 2001 - dl.acm.org
ABSTRACT A metasearch engine is a system that supports unified access to multiple local
search engines. Database selection is one of the main challenges in building a large-scale …

Comparing the performance of collection selection algorithms

AL Powell, JC French - ACM Transactions on Information Systems (TOIS …, 2003 - dl.acm.org
The proliferation of online information resources increases the importance of effective and
efficient information retrieval in a multicollection environment. Multicollection searching is …

Filtering system for providing personalized information in the absence of negative data

J Alspector, A Kolcz - US Patent 7,567,958, 2009 - Google Patents
The huge amount of information available at any one time in the evolving World Wide
information infrastructure, and particularly the volume of information accessible via the …

Discovering the representative of a search engine

KL Liu, A Santoso, C Yu, W Meng - Proceedings of the tenth international …, 2001 - dl.acm.org
Given a large number of search engines on the Internet, it is difficult for a person to
determine which search engines could serve his/her information needs. A common solution …

A methodology to retrieve text documents from multiple databases

C Yu, KL Liu, W Meng, Z Wu… - IEEE Transactions on …, 2002 - ieeexplore.ieee.org
This paper presents a methodology for finding the n most similar documents across multiple
text databases for any given query and for any positive integer n. This methodology consists …

Efficient and effective metasearch for text databases incorporating linkages among documents

C Yu, W Meng, W Wu, KL Liu - Proceedings of the 2001 ACM SIGMOD …, 2001 - dl.acm.org
Linkages among documents have a significant impact on the importance of documents, as it
can be argued that important documents are pointed to by many documents or by other …

Filtering system for providing personalized information in the absence of negative data

J Alspector, A Kolcz - US Patent 8,060,507, 2011 - Google Patents
US PATENT DOCUMENTS 6,341,277 B1 1/2002 Coden et al. 6,347,317 B1 2/2002 Singhal
6,397.211 B1 5/2002 Cooper 6,430,559 B1 8, 2002 Zhai 6,560,590 B1 5, 2003 Shwe et al …

Efficient and effective metasearch for a large number of text databases

C Yu, W Meng, KL Liu, W Wu, N Rishe - Proceedings of the eighth …, 1999 - dl.acm.org
Metasearch engines can be used to facilitate ordinary users for retrieving information from
multiple local sources (text databases). In a metasearch engine, the contents of each local …