Topic modeling using latent Dirichlet allocation: A survey

U Chauhan, A Shah - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
We are not able to deal with a mammoth text corpus without summarizing them into a
relatively small subset. A computational tool is extremely needed to understand such a …

MetaLDA: A topic model that efficiently incorporates meta information

H Zhao, L Du, W Buntine, G Liu - 2017 IEEE international …, 2017 - ieeexplore.ieee.org
Besides the text content, documents and their associated words usually come with rich sets
of meta information, such as categories of documents and semantic/syntactic features of …

Discovering online shopping preference structures in large and frequently changing store assortments

M Kim, J Zhang - Journal of Marketing Research, 2023 - journals.sagepub.com
The authors develop an attribute-based mixed-membership model of consumers' preference
for stockkeeping units in store assortments. The model represents the underlying “topics of …

Homogeneity-based transmissive process to model true and false news in social networks

J Kim, D Kim, A Oh - Proceedings of the Twelfth ACM International …, 2019 - dl.acm.org
An overwhelming number of true and false news stories are posted and shared in social
networks, and users diffuse the stories based on multiple factors. Diffusion of news stories …

Modeling topics in DFA-based lemmatized Gujarati text

U Chauhan, S Shah, D Shiroya, D Solanki, Z Patel… - Sensors, 2023 - mdpi.com
Topic modeling is a machine learning algorithm based on statistics that follows
unsupervised machine learning techniques for mapping a high-dimensional corpus to a low …

Leveraging external information in topic modelling

H Zhao, L Du, W Buntine, G Liu - Knowledge and Information Systems, 2019 - Springer
Besides the text content, documents usually come with rich sets of meta-information, such as
categories of documents and semantic/syntactic features of words, like those encoded in …

Is p-value 0.05 enough? A study on the statistical evaluation of classifiers

NM Neumann, A Plastino, JAP Junior… - The Knowledge …, 2020 - cambridge.org
Statistical significance analysis, based on hypothesis tests, is a common approach for
comparing classifiers. However, many studies oversimplify this analysis by simply checking …

Covariate-dependent hierarchical Dirichlet process

H Zhang, S Wade, N Bochkina - arXiv preprint arXiv:2407.02676, 2024 - arxiv.org
The intricacies inherent in contemporary real datasets demand more advanced statistical
models to effectively address complex challenges. In this article we delve into problems …

A Bayesian nonparametric topic model for microbiome data using subject attributes

T Okui - IPSJ Transactions on Bioinformatics, 2020 - jstage.jst.go.jp
Microbiome data have been obtained relatively easily in recent years, and currently, various
methods for analyzing microbiome data are being proposed. Latent Dirichlet allocation …

Dynamic content based ranking

S Virtanen, M Girolami - International Conference on …, 2020 - proceedings.mlr.press
We introduce a novel state space model for a set of sequentially time-stamped partial
rankings of items and textual descriptions for the items. Based on the data, the model infers …