[HTML][HTML] A recent overview of the state-of-the-art elements of text classification

MM Mirończuk, J Protasiewicz - Expert Systems with Applications, 2018 - Elsevier
The aim of this study is to provide an overview the state-of-the-art elements of text
classification. For this purpose, we first select and investigate the primary and recent studies …

Feature selection for text classification: A review

X Deng, Y Li, J Weng, J Zhang - Multimedia Tools and Applications, 2019 - Springer
Big multimedia data is heterogeneous in essence, that is, the data may be a mixture of
video, audio, text, and images. This is due to the prevalence of novel applications in recent …

[PDF][PDF] 基于机器学习的文本分类技术研究进展

苏金树, 张博锋, 徐昕[1 - 软件学报, 2006 - Citeseer
文本自动分类是信息检索与数据挖掘领域的研究热点与核心技术, 近年来得到了广泛的关注和
快速的发展. 提出了基于机器学习的文本分类技术所面临的互联网内容信息处理等复杂应用的 …

An analysis of hierarchical text classification using word embeddings

RA Stein, PA Jaques, JF Valiati - Information Sciences, 2019 - Elsevier
Efficient distributed numerical word representation models (word embeddings) combined
with modern machine learning algorithms have recently yielded considerable improvement …

A survey of hierarchical classification across different application domains

CN Silla, AA Freitas - Data mining and knowledge discovery, 2011 - Springer
In this survey we discuss the task of hierarchical classification. The literature about this field
is scattered across very different application domains and for that reason research in one …

Hierarchical multi-label text classification: An attention-based recurrent network approach

W Huang, E Chen, Q Liu, Y Chen, Z Huang… - Proceedings of the 28th …, 2019 - dl.acm.org
Hierarchical multi-label text classification (HMTC) is a fundamental but challenging task of
numerous applications (eg, patent annotation), where documents are assigned to multiple …

An integrated text analytic framework for product defect discovery

AS Abrahams, W Fan, GA Wang… - Production and …, 2015 - journals.sagepub.com
The recent surge in the usage of social media has created an enormous amount of user‐
generated content (UGC). While there are streams of research that seek to mine UGC, these …

An improved K-nearest-neighbor algorithm for text categorization

S Jiang, G Pang, M Wu, L Kuang - Expert Systems with Applications, 2012 - Elsevier
Text categorization is a significant tool to manage and organize the surging text data. Many
text categorization algorithms have been explored in previous literatures, such as KNN …

Demystifying softmax gating function in Gaussian mixture of experts

H Nguyen, TT Nguyen, N Ho - Advances in Neural …, 2024 - proceedings.neurips.cc
Understanding the parameter estimation of softmax gating Gaussian mixture of experts has
remained a long-standing open problem in the literature. It is mainly due to three …

Hierarchical document categorization with support vector machines

L Cai, T Hofmann - Proceedings of the thirteenth ACM international …, 2004 - dl.acm.org
Automatically categorizing documents into pre-defined topic hierarchies or taxonomies is a
crucial step in knowledge and content management. Standard machine learning techniques …