Generalised Brown clustering and roll-up feature generation

L Derczynski, S Chester - Proceedings of the AAAI Conference on …, 2016 - ojs.aaai.org
Brown clustering is an established technique, used in hundreds of computational linguistics
papers each year, to group word types that have similar distributional information. It is …

Distributional lattices as a model for discovering syntactic categories in child-directed speech

H Zhu, A Clark - Journal of Psycholinguistic Research, 2022 - Springer
Distribution information plays an important role in word categorization. In this paper, we
present a novel distributional model, distributional lattices to discover syntactic categories in …

Fine-grained Arabic named entity recognition

FSS Alotaibi - 2015 - etheses.bham.ac.uk
This thesis addresses the problem of fine-grained NER for Arabic, which poses unique
linguistic challenges to NER; such as the absence of capitalisation and short vowels, the …

Empirical studies on word representations

S Suster - 2016 - research.rug.nl
One of the most fundamental tasks in natural language processing is representing words
with mathematical objects (such as vectors). The word representations, which are most often …

Word representations, tree models and syntactic functions

S Šuster, G van Noord, I Titov - arXiv preprint arXiv:1508.07709, 2015 - arxiv.org
Word representations induced from models with discrete latent variables (eg\HMMs) have
been shown to be beneficial in many NLP applications. In this work, we exploit labeled …

Characterizing Word Representations For Natural Language Processing

MR Ciosici - 2019 - pure.au.dk
Over the last decade, Natural Language Processing (NLP) technologies have entered
mainstream usage. Machine Translation, Intelligent Virtual Assistants like Siri and Alexa …

[PDF][PDF] Extending Hidden Markov (tree) models for word representations

S Šuster - BENELEARN 2014 - simonsuster.github.io
There is ample research in natural language processing (NLP) on obtaining word
representations, including vector space modeling, clustering and techniques derived from …