ldagibbs: A command for topic modeling in Stata using latent Dirichlet allocation

C Schwarz - The Stata Journal, 2018 - journals.sagepub.com
The Stata Journal, 2018journals.sagepub.com
In this article, I introduce the ldagibbs command, which implements latent Dirichlet allocation
in Stata. Latent Dirichlet allocation is the most popular machine-learning topic model. Topic
models automatically cluster text documents into a user-chosen number of topics. Latent
Dirichlet allocation represents each document as a probability distribution over topics and
represents each topic as a probability distribution over words. Therefore, latent Dirichlet
allocation provides a way to analyze the content of large unclassified text data and an …
In this article, I introduce the ldagibbs command, which implements latent Dirichlet allocation in Stata. Latent Dirichlet allocation is the most popular machine-learning topic model. Topic models automatically cluster text documents into a user-chosen number of topics. Latent Dirichlet allocation represents each document as a probability distribution over topics and represents each topic as a probability distribution over words. Therefore, latent Dirichlet allocation provides a way to analyze the content of large unclassified text data and an alternative to predefined document classifications.
Sage Journals
以上显示的是最相近的搜索结果。 查看全部搜索结果