Statistical language models for information retrieval a critical review

CX Zhai - Foundations and Trends® in Information Retrieval, 2008 - nowpublishers.com
Statistical language models have recently been successfully applied to many information
retrieval problems. A great deal of recent work has shown that statistical language models …

[图书][B] Introduction to information retrieval

CD Manning - 2008 - diglib.globalcollege.edu.et
Introduction to Information Retrieval is the first textbook with a coherent treatment of classical
and web information retrieval, including web search and the related areas of text …

A probabilistic model for online document clustering with application to novelty detection

J Zhang, Z Ghahramani, Y Yang - Advances in neural …, 2004 - proceedings.neurips.cc
In this paper we propose a probabilistic model for online document clus-tering. We use non-
parametric Dirichlet process prior to model the grow-ing number of clusters, and use a prior …

[图书][B] A generative theory of relevance

V Lavrenko, WB Croft - 2009 - Springer
A modern information retrieval system must have the capability to find, organize and present
very different manifestations of information–such as text, pictures, videos or database …

TF-IDF uncovered: a study of theories and probabilities

T Roelleke, J Wang - Proceedings of the 31st annual international ACM …, 2008 - dl.acm.org
Interpretations of TF-IDF are based on binary independence retrieval, Poisson, information
theory, and language modelling. This paper contributes a review of existing interpretations …

Nonparametric Bayesian image segmentation

P Orbanz, JM Buhmann - International Journal of Computer Vision, 2008 - Springer
Image segmentation algorithms partition the set of pixels of an image into a specific number
of different, spatially homogeneous groups. We propose a nonparametric Bayesian model …

Corpus structure, language models, and ad hoc information retrieval

O Kurland, L Lee - Proceedings of the 27th annual international ACM …, 2004 - dl.acm.org
Most previous work on the recently developed language-modeling approach to information
retrieval focuses on document-specific characteristics, and therefore does not take into …

[PDF][PDF] An information-theoretic approach to automatic evaluation of summaries

CY Lin, G Cao, J Gao, JY Nie - Proceedings of the Human …, 2006 - aclanthology.org
Until recently there are no common, convenient, and repeatable evaluation methods that
could be easily applied to support fast turn-around development of automatic text …

Probabilistic relevance ranking for collaborative filtering

J Wang, S Robertson, AP de Vries, MJT Reinders - Information Retrieval, 2008 - Springer
Collaborative filtering is concerned with making recommendations about items to users.
Most formulations of the problem are specifically designed for predicting user ratings …

Topic based language models for ad hoc information retrieval

L Azzopardi, M Girolami… - 2004 IEEE International …, 2004 - ieeexplore.ieee.org
We propose a topic based approach to language modelling for ad-hoc information retrieval
(IR). Many smoothed estimators used for the multinomial query model in IR rely upon the …