Topic modeling using latent Dirichlet allocation: A survey

U Chauhan, A Shah - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
We are not able to deal with a mammoth text corpus without summarizing them into a
relatively small subset. A computational tool is extremely needed to understand such a …

Content analysis of e-petitions with topic modeling: How to train and evaluate LDA models?

L Hagen - Information Processing & Management, 2018 - Elsevier
E-petitions have become a popular vehicle for political activism, but studying them has been
difficult because efficient methods for analyzing their content are currently lacking …

Design and evaluation of a parallel document clustering algorithm based on hierarchical latent semantic analysis

K Seshadri, KV Iyer - Concurrency and Computation: Practice …, 2019 - Wiley Online Library
We propose a parallel generalization scheme for Singular Value Decomposition–based
clustering algorithms. The scheme enables the clustering algorithm to generate a hierarchy …

Supervised probabilistic latent semantic analysis with applications to controversy analysis of legislative bills

E Alemayehu, Y Fang - Intelligent Data Analysis, 2024 - content.iospress.com
Abstract Probabilistic Latent Semantic Analysis (PLSA) is a fundamental text analysis
technique that models each word in a document as a sample from a mixture of topics. PLSA …

The Software for Identifying Technological Complementarity Between Enterprises Based on Patent Databases

A Bezruchenko, D Korobkin, S Fomenkov… - Creativity in Intelligent …, 2021 - Springer
In this paper, it is proposed to identify the technological complementarity of enterprises. The
process of identifying potential partners is based on the comparison of cluster information …

What quality signifies to the Big Data and Machine Learning industry?

KV Iyer - Available at SSRN 4698217, 2024 - papers.ssrn.com
A limited survey of ML techniques viewed from quality of output as implied by embedded
and explicit metrics (eg, optimization in LP, sensitivity analysis vs. possible binary …

A quality criteria based evaluation of topic models

VR Sathi, JS Ramanujapura - 2016 - diva-portal.org
Objectives. In our study, we provide an overview of the amount of research that has been
done in relation to topic models. We want to uncover various quality criteria, evaluation …

Research and improvement of k-means parallel multi-association clustering algorithm

S Huang, B Zhang - Proceedings of the 2020 International Conference …, 2020 - dl.acm.org
In this paper, k-means parallel clustering algorithm is studied. Firstly, this paper introduces
the purpose and significance of k-means clustering algorithm. Secondly, we describe the …

[PDF][PDF] MapReduce Parallel Implementation of Improved K-means Clustering Algorithm on Spark Platform

H Suyu, T Lingli - 2019 - webofproceedings.org
Cloud Computing is the development of Distributed Computing, Parallel Computing and
Grid Computing. Cloud computing is a new distributed parallel computing environment or …

[PDF][PDF] OPTIMIZING TOPIC COHERENCE IN THE GUJARATI TEXT TOPIC MODELING: A RELEVANT WORDS BASED APPROACH

CU Ghanshyambhai, A Shah - 2018 - gtusitecirculars.s3.amazonaws.com
Topic models have gained extensive consideration from the information retrieval research
community. As a result, the variety of extensions of topics modelling techniques have been …