查看文章

作者

Joni Salminen, Maximilian Hopf, Shammur A Chowdhury, Soon-gyo Jung, Hind Almerekhi, Bernard J Jansen

发表日期

2020/12

期刊

Human-centric Computing and Information Sciences

卷号

页码范围

1-34

出版商

Springer Berlin Heidelberg

简介

The proliferation of social media enables people to express their opinions widely online. However, at the same time, this has resulted in the emergence of conflict and hate, making online environments uninviting for users. Although researchers have found that hate is a problem across multiple platforms, there is a lack of models for online hate detection using multi-platform data. To address this research gap, we collect a total of 197,566 comments from four platforms: YouTube, Reddit, Wikipedia, and Twitter, with 80% of the comments labeled as non-hateful and the remaining 20% labeled as hateful. We then experiment with several classification algorithms (Logistic Regression, Naïve Bayes, Support Vector Machines, XGBoost, and Neural Networks) and feature representations (Bag-of-Words, TF-IDF, Word2Vec, BERT, and their combination). While all the models significantly outperform the keyword-based …

引用总数

被引用次数：248

2020202120222023202420 59 73 67 25

学术搜索中的文章

Developing an online hate classifier for multiple social media platforms

J Salminen, M Hopf, SA Chowdhury, S Jung… - Human-centric Computing and Information Sciences, 2020

被引用次数：248 相关文章所有 10 个版本