Topic modeling algorithms and applications: A survey

A Abdelrazek, Y Eid, E Gawish, W Medhat, A Hassan - Information Systems, 2023 - Elsevier
Topic modeling is used in information retrieval to infer the hidden themes in a collection of
documents and thus provides an automatic means to organize, understand and summarize …

A survey of knowledge-enhanced text generation

W Yu, C Zhu, Z Li, Z Hu, Q Wang, H Ji… - ACM Computing …, 2022 - dl.acm.org
The goal of text-to-text generation is to make machines express like a human in many
applications such as conversation, summarization, and translation. It is one of the most …

BERTopic: Neural topic modeling with a class-based TF-IDF procedure

M Grootendorst - arXiv preprint arXiv:2203.05794, 2022 - arxiv.org
Topic models can be useful tools to discover latent topics in collections of documents.
Recent studies have shown the feasibility of approach topic modeling as a clustering task …

Short text topic modeling techniques, applications, and performance: a survey

J Qiang, Z Qian, Y Li, Y Yuan… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Analyzing short texts infers discriminative and coherent latent topics that is a critical and
fundamental task since many real-world applications require semantic understanding of …

Topic modelling meets deep neural networks: A survey

H Zhao, D Phung, V Huynh, Y Jin, L Du… - arXiv preprint arXiv …, 2021 - arxiv.org
Topic modelling has been a successful technique for text analysis for almost twenty years.
When topic modelling met deep neural networks, there emerged a new and increasingly …

[HTML][HTML] A clinical text classification paradigm using weak supervision and deep representation

Y Wang, S Sohn, S Liu, F Shen, L Wang… - BMC medical informatics …, 2019 - Springer
Background Automatic clinical text classification is a natural language processing (NLP)
technology that unlocks information embedded in clinical narratives. Machine learning …

Semantic text classification: A survey of past and recent advances

B Altınel, MC Ganiz - Information Processing & Management, 2018 - Elsevier
Automatic text classification is the task of organizing documents into pre-determined classes,
generally using machine learning algorithms. Generally speaking, it is one of the most …

Improving topic models with latent feature word representations

DQ Nguyen, R Billingsley, L Du… - Transactions of the …, 2015 - direct.mit.edu
Probabilistic topic models are widely used to discover latent topics in document collections,
while latent feature vector representations of words have been used to obtain high …

Topic memory networks for short text classification

J Zeng, J Li, Y Song, C Gao, MR Lyu, I King - arXiv preprint arXiv …, 2018 - arxiv.org
Many classification models work poorly on short texts due to data sparsity. To address this
issue, we propose topic memory networks for short text classification with a novel topic …

[HTML][HTML] A review of text corpus-based tourism big data mining

Q Li, S Li, S Zhang, J Hu, J Hu - Applied Sciences, 2019 - mdpi.com
With the massive growth of the Internet, text data has become one of the main formats of
tourism big data. As an effective expression means of tourists' opinions, text mining of such …