Variations on language modeling for information retrieval.

W Kraaij - 2004 - research.utwente.nl
Search engine technology builds on theoretical and empirical research results in the area of
information retrieval (IR). This dissertation makes a contribution to the field of language …

Document clustering with universum

D Zhang, J Wang, L Si - Proceedings of the 34th international ACM SIGIR …, 2011 - dl.acm.org
Document clustering is a popular research topic, which aims to partition documents into
groups of similar objects (ie, clusters), and has been widely used in many applications such …

[PDF][PDF] Correlation Clustering for Crosslingual Link Detection.

J Van Gael, X Zhu - IJCAI, 2007 - pages.cs.wisc.edu
The crosslingual link detection problem calls for identifying news articles in multiple
languages that report on the same news event. This paper presents a novel approach based …

[PDF][PDF] TNO Hierarchical topic detection report at TDT 2004

D Trieschnigg, W Kraaij - Topic Detection and Tracking Workshop …, 2004 - researchgate.net
Hierarchical topic detection is a new task in the TDT 2004 evaluation program, which aims to
organize a collection of unstructured news data in a directed acyclic graph (DAG) structure …

[PDF][PDF] Task based evaluation of exploratory search systems

W Kraaij, W Post - Proc. of SIGIR 2006 Workshop, Evaluation …, 2006 - liacs.leidenuniv.nl
Evaluation of interactive search systems has always been time-consuming and complex,
which probably explains the relative low level of interest from IR researchers for this type of …

Language models for topic tracking: The importance of score normalization

W Kraaij, M Spitters - Language modeling for information retrieval, 2003 - Springer
Generative unigram language models have proven to be a simple though effective model for
information retrieval tasks. In contrast to ad-hoc retrieval, topic tracking requires that …

Hierarchical topic detection in large digital news archives: exploring a sample based approach

D Trieschnigg, W Kraaij - Journal of Digital Information …, 2005 - research.utwente.nl
Hierarchical topic detection is a new task in the TDT 2004 evaluation program, which aims to
organize a collection of unstructured news data in a directed acyclic graph (DAG) structure …

Multimedia search without visual analysis: the value of linguistic and contextual information

FMG de Jong, T Westerveld… - IEEE Transactions on …, 2007 - ieeexplore.ieee.org
This paper addresses the focus of this special issue by analyzing the potential contribution
of linguistic content and other nonimage aspects to the processing of audiovisual data. It …

Automated speech and audio analysis for semantic access to multimedia

F de Jong, R Ordelman, M Huijbregts - International Conference on …, 2006 - Springer
The deployment and integration of audio processing tools can enhance the semantic
annotation of multimedia content, and as a consequence, improve the effectiveness of …

[PDF][PDF] Identifying event descriptions using co-training with online news summaries

WY Wang, K Thadani, K McKeown - Proceedings of 5th …, 2011 - aclanthology.org
Abstract Systems that distill information about events from large corpora generally extract
sentences that are relevant to a short event query. We present a novel co-training strategy …