作者
Abhinay Pandya, Mourad Oussalah, Paola Monachesi, Panos Kostakos, Lauri Lovén
发表日期
2018/7/6
研讨会论文
2018 ieee international conference on information reuse and integration (iri)
页码范围
62-69
出版商
Ieee
简介
Social media data represent an important resource for behavioral analysis of the ageing population. This paper addresses the problem of age prediction from Twitter dataset, where the prediction issue is viewed as a classification task. For this purpose, an innovative model based on Convolutional Neural Network is devised. To this end, we rely on language-related features and social media specific metadata. More specifically, we introduce two features that have not been previously considered in the literature: the content of URLs and hashtags appearing in tweets. We also employ distributed representations of words and phrases present in tweets, hashtags and URLs, pre-trained on appropriate corpora in order to exploit their semantic information in age prediction. We show that our CNN-based classifier, when compared with an SVM baseline model, yields an improvement of 12.3% and 6.6% in the micro …
引用总数
201920202021202220232024162323
学术搜索中的文章
A Pandya, M Oussalah, P Monachesi, P Kostakos… - 2018 ieee international conference on information …, 2018