Automated discovery of search interfaces on the web

J Cope, N Craswell, D Hawking - The Fourteenth Australasian …, 2003 - microsoft.com
Web search engines work well for finding crawlable pages, but not for finding datasets
hidden behind Web search forms. We describe a novel technique for detecting search forms …

Comparing the performance of collection selection algorithms

AL Powell, JC French - ACM Transactions on Information Systems (TOIS …, 2003 - dl.acm.org
The proliferation of online information resources increases the importance of effective and
efficient information retrieval in a multicollection environment. Multicollection searching is …

[PDF][PDF] Web search technology

C Yu, W Meng - The internet encyclopedia, 2003 - Citeseer
Abstract The World Wide Web has become the largest information source in recent years
and search engines are indispensable tools for finding needed information from the Web …

Distributed information retrieval: A multi-objective resource selection approach

S Wu, F Crestani - … Journal of Uncertainty, Fuzziness and Knowledge …, 2003 - World Scientific
Information retrieval is becoming increasingly concerned with resource selection and data
fusion for distributed archives. In distributed information retrieval, a user submits a query to a …

Experiments with document archive size detection

S Wu, F Gibb, F Crestani - European Conference on Information Retrieval, 2003 - Springer
The size of a document archive is a very important parameter for resource selection in
distributed information retrieval systems. In this paper, we present a method for automatically …

Evaluating database selection algorithms for distributed search

M Sogrine, A Patel - Proceedings of the 2003 ACM symposium on …, 2003 - dl.acm.org
We investigate algorithms of database selection and evaluate their performance by
modelling a distributed search system. The evaluation is done using a 10 gigabyte …

[PDF][PDF] The Cyclades Collection Service

L Candela, D Castelli, P Pagano… - 2003 - iris.cnr.it
This report introduces a digital library service, termed Collection Service, designed to
support the dynamic creation of virtual collections of documents. Collections are created by …

Accessing hidden web documents by metasearching a directory of specialty search engines

JKH Shiu, SCF Chan, KFL Chung - … 2003, Aizu, Japan, September 22-24 …, 2003 - Springer
Many valuable Web documents have not been indexed by general search engines and are
only accessible through specific search interfaces. Metasearching groups of specialty …

[PDF][PDF] Discovering and ranking data intensive web services: A source-biased approach

J Caverlee, L Liu, D Rocco - 2003 - cercs.gatech.edu
This paper presents a novel source-biased approach to automatically discover and rank
relevant data intensive web services. It supports a service-centric view of the Web through …

[PDF][PDF] Architectural Design of WebScales-A Large-Scale Metasearch Engine

W Meng, C Yu, Z Wu, V Raghavan - academia.edu
It is estimated that there are hundreds of thousands of information sources on the Web,
including both the Surface Web and the Deep Web. Most of these sources have their own …