Sentiment analysis of blogs by combining lexical knowledge with text classification

P Melville, W Gryc, RD Lawrence - Proceedings of the 15th ACM …, 2009 - dl.acm.org
P Melville, W Gryc, RD Lawrence
Proceedings of the 15th ACM SIGKDD international conference on Knowledge …, 2009dl.acm.org
The explosion of user-generated content on the Web has led to new opportunities and
significant challenges for companies, that are increasingly concerned about monitoring the
discussion around their products. Tracking such discussion on weblogs, provides useful
insight on how to improve products or market them more effectively. An important component
of such analysis is to characterize the sentiment expressed in blogs about specific brands
and products. Sentiment Analysis focuses on this task of automatically identifying whether a …
The explosion of user-generated content on the Web has led to new opportunities and significant challenges for companies, that are increasingly concerned about monitoring the discussion around their products. Tracking such discussion on weblogs, provides useful insight on how to improve products or market them more effectively. An important component of such analysis is to characterize the sentiment expressed in blogs about specific brands and products. Sentiment Analysis focuses on this task of automatically identifying whether a piece of text expresses a positive or negative opinion about the subject matter. Most previous work in this area uses prior lexical knowledge in terms of the sentiment-polarity of words. In contrast, some recent approaches treat the task as a text classification problem, where they learn to classify sentiment based only on labeled training data. In this paper, we present a unified framework in which one can use background lexical information in terms of word-class associations, and refine this information for specific domains using any available training examples. Empirical results on diverse domains show that our approach performs better than using background knowledge or training data in isolation, as well as alternative approaches to using lexical knowledge with text classification.
ACM Digital Library
以上显示的是最相近的搜索结果。 查看全部搜索结果