System, method and apparatus for discovering phrases in a database

MW Mcgreevy - US Patent 6,721,728, 2004 - Google Patents
US6721728B2 - System, method and apparatus for discovering phrases in a database - Google
Patents US6721728B2 - System, method and apparatus for discovering phrases in a database …

Text classification based on multi-word with support vector machine

W Zhang, T Yoshida, X Tang - Knowledge-Based Systems, 2008 - Elsevier
One of the main themes supporting text mining is text representation, ie, looking for the
appropriate terms to transfer the documents into numerical vectors. Recently, many efforts …

Positional relevance model for pseudo-relevance feedback

Y Lv, CX Zhai - Proceedings of the 33rd international ACM SIGIR …, 2010 - dl.acm.org
Pseudo-relevance feedback is an effective technique for improving retrieval results.
Traditional feedback algorithms use a whole feedback document as a unit to extract words …

Positional language models for information retrieval

Y Lv, CX Zhai - Proceedings of the 32nd international ACM SIGIR …, 2009 - dl.acm.org
Although many variants of language models have been proposed for information retrieval,
there are two related retrieval heuristics remaining" external" to the language modeling …

An exploration of proximity measures in information retrieval

T Tao, CX Zhai - Proceedings of the 30th annual international ACM …, 2007 - dl.acm.org
In most existing retrieval models, documents are scored primarily based on various kinds of
term statistics such as within-document frequencies, inverse document frequencies, and …

Relevance ranking for one to three term queries

CLA Clarke, GV Cormack, EA Tudhope - Information processing & …, 2000 - Elsevier
We investigate the application of a novel relevance ranking technique, cover density
ranking, to the requirements of Web-based information retrieval, where a typical query …

Context and page analysis for improved web search

S Lawrence, CL Giles - IEEE Internet computing, 1998 - ieeexplore.ieee.org
NECI Research Institute has developed a metasearch engine that improves the efficiency of
Web searches by downloading and analyzing each document and then displaying results …

System, method and apparatus for conducting a keyterm search

MW Mcgreevy - US Patent 6,823,333, 2004 - Google Patents
Subsets of the database that are relevant to an input query. First, a number of relational
models of Subsets of a database are provided. A query is then input. The query can include …

Effective ranking with arbitrary passages

M Kaszkiel, J Zobel - Journal of the American Society for …, 2001 - Wiley Online Library
Text retrieval systems store a great variety of documents, from abstracts, newspaper articles,
and Web pages to journal articles, books, court transcripts, and legislation. Collections of …

Modeling term proximity for probabilistic information retrieval models

B He, JX Huang, X Zhou - Information Sciences, 2011 - Elsevier
Proximity among query terms has been found to be useful for improving retrieval
performance. However, its application to classical probabilistic information retrieval models …