作者
Yanwei Bao, Changqin Quan, Lijuan Wang, Fuji Ren
发表日期
2014
研讨会论文
Intelligent Computing Methodologies: 10th International Conference, ICIC 2014, Taiyuan, China, August 3-6, 2014. Proceedings 10
页码范围
615-624
出版商
Springer International Publishing
简介
Recently, increasing attention has been attracted to Social Networking Sentiment Analysis. Twitter as one of the most fashional social networking platforms has been researched as a hot topic in this domain. Normally, sentiment analysis is regarded as a classification problem. Training a classifier with tweets data, there is a large amount of noise due to tweets’ shortness, marks, irregular words etc. In this work we explore the impact pre-processing methods make on twitter sentiment classification. We evaluate the effects of URLs, negation, repeated letters, stemming and lemmatization. Experimental results on the Stanford Twitter Sentiment Dataset show that sentiment classification accuracy rises when URLs features reservation, negation transformation and repeated letters normalization are employed while descends when stemming and lemmatization are applied. Moreover, we get a better result by …
引用总数
201320142015201620172018201920202021202220232024144111614161719147
学术搜索中的文章
Y Bao, C Quan, L Wang, F Ren - … : 10th International Conference, ICIC 2014, Taiyuan …, 2014