Feature selection for text classification: A review

X Deng, Y Li, J Weng, J Zhang - Multimedia Tools and Applications, 2019 - Springer
Big multimedia data is heterogeneous in essence, that is, the data may be a mixture of
video, audio, text, and images. This is due to the prevalence of novel applications in recent …

Machine learning in automated text categorization

F Sebastiani - ACM computing surveys (CSUR), 2002 - dl.acm.org
The automated categorization (or classification) of texts into predefined categories has
witnessed a booming interest in the last 10 years, due to the increased availability of …

A survey of text classification algorithms

CC Aggarwal, CX Zhai - Mining text data, 2012 - Springer
The problem of classification has been widely studied in the data mining, machine learning,
database, and information retrieval communities with applications in a number of diverse …

A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm

H Uğuz - Knowledge-Based Systems, 2011 - Elsevier
Text categorization is widely used when organizing documents in a digital form. Due to the
increasing number of documents in digital form, automated text categorization has become …

[HTML][HTML] An efficient instance selection algorithm to reconstruct training set for support vector machine

C Liu, W Wang, M Wang, F Lv, M Konan - Knowledge-Based Systems, 2017 - Elsevier
Support vector machine is a classification model which has been widely used in many
nonlinear and high dimensional pattern recognition problems. However, it is inefficient or …

A machine learning approach to web page filtering using content and structure analysis

M Chau, H Chen - Decision Support Systems, 2008 - Elsevier
As the Web continues to grow, it has become increasingly difficult to search for relevant
information using traditional search engines. Topic-specific search engines provide an …

[HTML][HTML] Stock market prediction using Firefly algorithm with evolutionary framework optimized feature reduction for OSELM method

SR Das, D Mishra, M Rout - Expert Systems with Applications: X, 2019 - Elsevier
Forecasting future trends of the stock market using the historical data is the exigent demand
in the field of academia as well as business. This work has explored the feature optimization …

[PDF][PDF] Random forest approach fo sentiment analysis in indonesian

MA Fauzi - Indones. J. Electr. Eng. Comput. Sci, 2018 - researchgate.net
Sentiment analysis becomes very useful since the rise of social media and online review
website and, thus, the requirement of analyzing their sentiment in an effective and efficient …

[PDF][PDF] A learner-independent evaluation of the usefulness of statistical phrases for automated text categorization

MF Caropreso, S Matwin, F Sebastiani - Text databases and …, 2001 - nmis.isti.cnr.it
In this work we investigate the usefulness of n-grams for document indexing in text
categorization (TC). We call n-gram a set gk of n word stems, and we say that gk occurs in a …

[图书][B] Opening the black box-data driven visualization of neural networks

FY Tzeng, KL Ma - 2005 - ieeexplore.ieee.org
Artificial neural networks are computer software or hardware models inspired by the
structure and behavior of neurons in the human nervous system. As a powerful learning tool …