A survey of text classification algorithms

CC Aggarwal, CX Zhai - Mining text data, 2012 - Springer
The problem of classification has been widely studied in the data mining, machine learning,
database, and information retrieval communities with applications in a number of diverse …

Large-scale Bayesian logistic regression for text categorization

A Genkin, DD Lewis, D Madigan - technometrics, 2007 - Taylor & Francis
Logistic regression analysis of high-dimensional data, such as natural language text, poses
computational and statistical challenges. Maximum likelihood estimation often fails in these …

Sentiment analysis of blogs by combining lexical knowledge with text classification

P Melville, W Gryc, RD Lawrence - Proceedings of the 15th ACM …, 2009 - dl.acm.org
The explosion of user-generated content on the Web has led to new opportunities and
significant challenges for companies, that are increasingly concerned about monitoring the …

Learning from labeled features using generalized expectation criteria

G Druck, G Mann, A McCallum - … of the 31st annual international ACM …, 2008 - dl.acm.org
It is difficult to apply machine learning to new domains because often we lack labeled
problem instances. In this paper, we provide a solution to this problem that leverages …

Document-word co-regularization for semi-supervised sentiment analysis

V Sindhwani, P Melville - 2008 Eighth ieee international …, 2008 - ieeexplore.ieee.org
The goal of sentiment prediction is to automatically identify whether a given piece of text
expresses positive or negative opinion towards a topic of interest. One can pose sentiment …

Efficient bayesian hierarchical user modeling for recommendation system

Y Zhang, J Koren - Proceedings of the 30th annual international ACM …, 2007 - dl.acm.org
A content-based personalized recommendation system learns user specific profiles from
user feedback so that it can deliver information tailored to each individual user's interest. A …

Manifold adaptive experimental design for text categorization

D Cai, X He - IEEE Transactions on Knowledge and Data …, 2011 - ieeexplore.ieee.org
In many information processing tasks, labels are usually expensive and the unlabeled data
points are abundant. To reduce the cost on collecting labels, it is crucial to predict which …

Class imbalance and active learning

J Attenberg, Ş Ertekin - Imbalanced Learning: Foundations …, 2013 - Wiley Online Library
This chapter focuses on the interaction between active learning (AL) and class imbalance,
discussing (i) AL techniques designed specifically for dealing with imbalanced settings,(ii) …

[PDF][PDF] A statistical comparison of logistic regression and different Bayes classification methods for machine learning

LM Gladence, M Karthi, VM Anu - ARPN Journal of Engineering …, 2015 - researchgate.net
ABSTRACT Recent Machine Learning algorithms are widely available for various purposes.
But which classifier is suitable for particular data is not yet defined. To consider this into …

An interactive algorithm for asking and incorporating feature feedback into support vector machines

H Raghavan, J Allan - Proceedings of the 30th annual international …, 2007 - dl.acm.org
Standard machine learning techniques typically require ample training data in the form of
labeled instances. In many situations it may be too tedious or costly to obtain sufficient …