Topic modeling using latent Dirichlet allocation: A survey

U Chauhan, A Shah - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
We are not able to deal with a mammoth text corpus without summarizing them into a
relatively small subset. A computational tool is extremely needed to understand such a …

Probabilistic topic modeling in multilingual settings: An overview of its methodology and applications

I Vulić, W De Smet, J Tang, MF Moens - Information Processing & …, 2015 - Elsevier
Probabilistic topic models are unsupervised generative models which model document
content as a two-step generation process, that is, documents are observed as mixtures of …

Applications of topic models

J Boyd-Graber, Y Hu, D Mimno - Foundations and Trends® in …, 2017 - nowpublishers.com
How can a single person understand what's going on in a collection of millions of
documents? This is an increasingly common problem: sifting through an organization's e …

[PDF][PDF] Improving vector space word representations using multilingual correlation

M Faruqui, C Dyer - Proceedings of the 14th Conference of the …, 2014 - aclanthology.org
The distributional hypothesis of Harris (1954), according to which the meaning of words is
evidenced by the contexts they occur in, has motivated several effective techniques for …

Computer-assisted text analysis for comparative politics

C Lucas, RA Nielsen, ME Roberts, BM Stewart… - Political …, 2015 - cambridge.org
Recent advances in research tools for the systematic analysis of textual data are enabling
exciting new research throughout the social sciences. For comparative politics, scholars who …

[PDF][PDF] Bilingual word embeddings for phrase-based machine translation

WY Zou, R Socher, D Cer… - Proceedings of the 2013 …, 2013 - aclanthology.org
We introduce bilingual word embeddings: semantic embeddings associated across two
languages in the context of neural language models. We propose a method to learn …

[PDF][PDF] Deep multilingual correlation for improved word embeddings

A Lu, W Wang, M Bansal, K Gimpel… - Proceedings of the 2015 …, 2015 - aclanthology.org
Word embeddings have been found useful for many NLP tasks, including part-of-speech
tagging, named entity recognition, and parsing. Adding multilingual context when learning …

Multilingual topic models for unaligned text

J Boyd-Graber, D Blei - arXiv preprint arXiv:1205.2657, 2012 - arxiv.org
We develop the multilingual topic model for unaligned text (MuTo), a probabilistic model of
text that is designed to analyze corpora composed of documents in two languages. From …

Question retrieval with high quality answers in community question answering

K Zhang, W Wu, H Wu, Z Li, M Zhou - Proceedings of the 23rd ACM …, 2014 - dl.acm.org
This paper studies the problem of question retrieval in community question answering
(CQA). To bridge lexical gaps in questions, which is regarded as the biggest challenge in …

[PDF][PDF] Holistic sentiment analysis across languages: Multilingual supervised latent Dirichlet allocation

J Boyd-Graber, P Resnik - … of the 2010 Conference on Empirical …, 2010 - aclanthology.org
In this paper, we develop multilingual supervised latent Dirichlet allocation (MLSLDA), a
probabilistic generative model that allows insights gleaned from one language's data to …