Learning multilingual named entity recognition from Wikipedia

J Nothman, N Ringland, W Radford, T Murphy… - Artificial Intelligence, 2013 - Elsevier
We automatically create enormous, free and multilingual silver-standard training annotations
for named entity recognition (ner) by exploiting the text and structure of Wikipedia. Most ner …

Sentiment analysis of political communication: Combining a dictionary approach with crowdcoding

M Haselmayer, M Jenny - Quality & quantity, 2017 - Springer
Sentiment is important in studies of news values, public opinion, negative campaigning or
political polarization and an explosive expansion of digital textual data and fast progress in …

Social haystack: Dynamic quality assessment of citizen-generated content during emergencies

T Ludwig, C Reuter, V Pipek - ACM Transactions on Computer-Human …, 2015 - dl.acm.org
People all over the world are regularly affected by disasters and emergencies. Besides
official emergency services, ordinary citizens are getting increasingly involved in crisis …

[PDF][PDF] Processing and querying large web corpora with the COW14 architecture

R Schäfer - Proceedings of the 3rd Workshop on Challenges in …, 2015 - ids-pub.bsz-bw.de
In this paper, I present the COW14 tool chain, which comprises a web corpus creation tool
called texrex, wrappers for existing linguistic annotation tools as well as an online query …

POLYGLOT-NER: Massive Multilingual Named Entity Recognition

R Al-Rfou, V Kulkarni, B Perozzi, S Skiena - Proceedings of the 2015 SIAM …, 2015 - SIAM
The increasing diversity of languages used on the web introduces a new level of complexity
to Information Retrieval (IR) systems. We can no longer assume that textual content is written …

Cross-lingual word clusters for direct transfer of linguistic structure

O Täckström, R McDonald, J Uszkoreit - The 2012 conference of the …, 2012 - diva-portal.org
It has been established that incorporating word cluster features derived from large unlabeled
corpora can significantly improve prediction of linguistic structure. While previous work has …

Fine-grained named entity recognition in legal documents

E Leitner, G Rehm, J Moreno-Schneider - International conference on …, 2019 - Springer
This paper describes an approach at Named Entity Recognition (NER) in German language
documents from the legal domain. For this purpose, a dataset consisting of German court …

[HTML][HTML] Robust multilingual named entity recognition with shallow semi-supervised features

R Agerri, G Rigau - Artificial Intelligence, 2016 - Elsevier
We present a multilingual Named Entity Recognition approach based on a robust and
general set of features across languages and datasets. Our system combines shallow local …

[PDF][PDF] NoSta-D Named Entity Annotation for German: Guidelines and Dataset.

D Benikova, C Biemann, M Reznicek - LREC, 2014 - lrec-conf.org
We describe the annotation of a new dataset for German Named Entity Recognition (NER).
The need for this dataset is motivated by licensing issues and consistency issues of existing …

[PDF][PDF] Bootstrapped named entity recognition for product attribute extraction

D Putthividhya, J Hu - Proceedings of the 2011 Conference on …, 2011 - aclanthology.org
We present a named entity recognition (NER) system for extracting product attributes and
values from listing titles. Information extraction from short listing titles present a unique …