A web of concepts

N Dalvi, R Kumar, B Pang, R Ramakrishnan… - Proceedings of the …, 2009 - dl.acm.org
We make the case for developing a web of concepts by starting with the current view of web
(comprised of hyperlinked pages, or documents, each seen as a bag of words), extracting …

[PDF][PDF] Building linguistic corpora from Wikipedia articles and discussions

E Margaretha, H Lüngen - Journal for Language Technology and …, 2014 - jlcl.org
Wikipedia is a valuable resource, useful as a lingustic corpus or a dataset for many kinds of
research. We built corpora from Wikipedia articles and talk pages in the I5 format, a TEI …

[PDF][PDF] Supersense embeddings: A unified model for supersense interpretation, prediction, and utilization

L Flekova, I Gurevych - Proceedings of the 54th Annual Meeting …, 2016 - aclanthology.org
Coarse-grained semantic categories such as supersenses have proven useful for a range of
downstream tasks such as question answering or machine translation. To date, no effort has …

Ranking very many typed entities on wikipedia

H Zaragoza, H Rode, P Mika, J Atserias… - Proceedings of the …, 2007 - dl.acm.org
We discuss the problem of ranking very many entities of different types. In particular we deal
with a heterogeneous set of types, some being very generic and some very specific. We …

Finding support sentences for entities

R Blanco, H Zaragoza - Proceedings of the 33rd international ACM …, 2010 - dl.acm.org
We study the problem of finding sentences that explain the relationship between a named
entity and an ad-hoc query, which we refer to as entity support sentences. This is an …

Next generation Web search

R Baeza-Yates, P Raghavan - Search Computing. LNCS, 2010 - Springer
In this chapter we provide our personal vision of what could be the next generation of Web
search engines, outlining the main research challenges that derive from it. This vision is …

Learning from partially annotated sequences

ER Fernandes, U Brefeld - Machine Learning and Knowledge Discovery in …, 2011 - Springer
We study sequential prediction models in cases where only fragments of the sequences are
annotated with the ground-truth. The task does not match the standard semi-supervised …

[PDF][PDF] Company-oriented extractive summarization of financial news

K Filippova, M Surdeanu, M Ciaramita… - Proceedings of the …, 2009 - aclanthology.org
The paper presents a multi-document summarization system which builds companyspecific
summaries from a collection of financial news such that the extracted sentences contain …

Why finding entities in Wikipedia is difficult, sometimes

G Demartini, CS Firan, T Iofciu, R Krestel, W Nejdl - Information Retrieval, 2010 - Springer
Entity Retrieval (ER)—in comparison to classical search—aims at finding individual entities
instead of relevant documents. Finding a list of entities requires therefore techniques …

Towards semantic search

R Baeza-Yates, M Ciaramita, P Mika… - Natural Language and …, 2008 - Springer
Semantic search seems to be an elusive and fuzzy target to many researchers. One of the
reasons is that the task lies in between several areas of specialization. In this extended …