作者
Xiang Zhang, Jonathan Tong, Nishant Vishwamitra, Elizabeth Whittaker, Joseph P Mazer, Robin Kowalski, Hongxin Hu, Feng Luo, Jamie Macbeth, Edward Dillon
发表日期
2016/12/18
研讨会论文
2016 15th IEEE international conference on machine learning and applications (ICMLA)
页码范围
740-745
出版商
IEEE
简介
Cyberbullying can have a deep and long lasting impact on its victims, who are often adolescents. Accurately detecting cyberbullying helps prevent it. However, the noise and errors in social media posts and messages make detecting cyberbullying very challenging. In this paper, we propose a novel pronunciation based convolutional neural network (PCNN) to address this challenge. Upon observing that the pronunciation of misspelled words in informal online conversations is often unchanged, we used the phoneme codes of the text as the features for a convolutional neural network. This procedure corrects spelling errors that did not alter the pronunciation, thereby alleviating the problem of noise and bullying data sparsity. To overcome class imbalance, a common problem in cyberbullying datasets, we implement three techniques that include threshold-moving, cost function adjusting, and a hybrid solution in our …
引用总数
201720182019202020212022202320242612132726367
学术搜索中的文章
X Zhang, J Tong, N Vishwamitra, E Whittaker, JP Mazer… - 2016 15th IEEE international conference on machine …, 2016